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SEQUENCE^ LISTING 




(1) GENERAL INFORMATION: 

(i) APPLICANT: Barry. Gerard 

Kishore, GanaSh M. 
Padgette, Sty^hen R. 
^ Stallings, Jf/illiam C. 

(ii) TITLE OF INVENTION: Glyphosace Tolerant 

5-Enolpyruvylshik/mate-3-Phosphate Synthases 
(iii) NUMBER OF SEQUENCES: 69 

(iv) CORRESPONDENCE ADDRESS- 

!b! S^t'^WI^^^^^^^ CO. BB4F 

(C) CITY: St. Zouis 

(D) STATE: Mis/sour^T 

(E) COUNTRY: AJSA 

(F) ZIP: 63l/8 / H 

(V) COMPUTER REiy&AB/^oy»/ ^' 

(A) MEDIUM A-YPETl-loppy disk 

(B) COMPU^feR: IBl^ PC compatible 

(C) OPERATING SYSTEM: PC-DOS /MS-DOS 

(D) SOFTWARE: Patentin Release #1.0, Version #1.25 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 07/749 61 ^ 
•3) FILING DATE: :S-AUG-1991 
(C) CLASSIFICATION: 

(vii) PRIOR ^PLICATION DATA: 

(A) ABfPLICATION NUMBER: "JS 07/576,537 
(3) FILING DATE: 3i-AUG-1990 
•:C) QLASSIFICATION: 

(viii) ATTOR*EY/AGENT INFORMATION: 

;A) j^IAME: Hoerner Jr., Dennis R. 
:3) /REGISTRATION NUMBER: 3 0,914 
C) / REFERENCE /DOCKET -UMBER: ' 25-21 (i0660)A 

fix) TEL±:0MMUNICATXON INFORMATION: 
;a/ TELEPHONE: i 3 14 ) 537 -6099 ' 
.5* TELEFAX: 3 14 ) 5 3" -o047 
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(2) INFORMATION FOR SEQ ID NO 
(i) 



SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 597 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:1: 
TCATCAAAAT ATTTAGCAGC ATTCCAGATT CGGTTCAATC AACAAGGTAC GAGCCATATC 
ACTTTATTCA AA^CGTATC CCCAAAACCA AGAACGAACT CCCATCCTCA AAGGTTTGTA 
AGGAAGAATT CTCAGTCCAA AGCCTCAACA AGGTCAGGGT. ACAGAGTCTC CAAACCATTA 
CCCAAAAGCT ACAGGAGATC AATGAAGAAT CTTCAATCAA AGTAAACTAC TG^CCAGCA 
CATGCATCAT OGTCAGTAAG TTTCAGAAAA AGACATCCAC CGAAGAC^. ^orr.a.OO 
GCATC™ AAGTAATCTT GTCAACATCG AGCAGCTGGC TTGTGGGGAC CAGACAAAAA 

AGGAATGGTG CAGAATTGTT Arrrroa,-^-T, , 

AAITGTT AGGCGCACCT ACCAAAAGCA TCTTTGCCTT TATTGCAAAG 

ATAAAGCAGA TTCCTCTACT ACAACTGGGG AACAAAATAA CGTGGAAAAG AGCTGTCCTG 
ACAGCCCACT CACTAATGCG TATGACGAAC CCAGTGACGA CCACAAAAGA ATTCCCTCTA 

TATAAGAAGG CATTCATTCC CATTTPaai-o 

^^ii^^ LATTTGAAGG ATCATCAGAT ACTAACCAAT ATTTCTC 

(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 1982 base pairs 
13) TYPE, nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



60 
120 
180 
240 
300 
360 

420 

480 

540 

597 
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(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 62.. 1426 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 
MGCCCCCCT TCTCTCqoCC CCTCCCCCCC CACACCCCTC CATACATTAA CCAAOACCCC 



10 15 



25 30 



^ I- z Z -I - s ?S Z 



40 45 



:S - - Z 51? T. Z ?S c=S ™ 

50 ^■'■y Ala Met 



55 60 



J- ^IS Z Z ?S 

70 75 



GAT GGC GTC GGC AAT GGC GGC C^C CTG GCr r-- r^n n^r^ 
ASP Gl. val Gl. Asn Gl. 01. .eu Su^ J- ^ A^^ 

85 90 95 

TTC GGC AAT GCC GCC ACG GGC TGC CGC C^G ACC ATr rn - ^-ro ^ 

Phe CI. .s„ Al. Al, ™. 01. C3 a4 Leu Zl Z III III 



105 no 



120 125 



CGC CCG ATG GGC CGC GTG TTG AAC CCG CTG CGC GAA ATG GGC GTG C^G 
.-.rg .ro Met c-.y Arq Val leu Asn Pro Leu Arg Glu Met C-ly Val Gin 



:0 



60 
106 



TCT GGC CTT TCC GGA ACC GTC CGC ATT CCr rrr n.r. 

Ser Gly Leu Ser Gly Thr Val Arg ill frl f,^ ?° ^CC 154 

20 ^-^^ Ser He Ser 



202 



250 



298 



346 



394 



Z "vr Ih' f ' '^"^ ^^•'^^ <^-^ ACA AAG 442 

/al ./r Asp Fhe Asp S^r Thr Phe lie Gly Asp Ala Ser leu T.hr Lvs , 

lis i^tn 



490 



• 1 c 
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GTG AAA TCG GAA GAC GGT GAG CGT CT- rr- r-n,^ 

v.. se. - 511 Jl ™ - - - 

^3 s= ?s lit - - r r - - 

160 Val Pro Met Ala Ser Ala Gin 



175 



JJs Sa^ 11^ Su^ - - AAG ACG CGG GGC A.C ACC 

180 ""^^ J-!" Gly He Thr 



?S G°?! ^^.^ J- - - CGC GAX CA. ACG GAA AAG A.C C.G 

. 195 Thr Glu Lys Met Leu 

205 

CAG GGC TTT GGC GCC AAG CTT ACC GTr rar 

Gin Gly Phe Gly Ala Asn Leu vTl Glu f ^ ^''^ 

210 Thr Asp Ala Asp Gly Val 

^-^^ 220 

z t\i it. r r - - - 

225 '^ly Leu Thr Gly Gin Val He 

^ 235 

GAC GTG CCG GGC GAC CCG TCr Trr> ^ 
ASP val Pro Gly Asp Pro sir Ser ^hr Al' H"" 

240 245 ^-^^ Leu Val Ala Ala 



245 

255 



CTG CTT GTT CCG GGC TCC GAC G-- Arr ^ 

Leu Leu val Pro Gly Ser Aso vli '^^^ 

260 " *• ft! Val Leu Met Asn 



265 



CCC ACC CGC ACC GGC CTC ATC r-r 

Pro Thr Arg Thr Gly Leu lie Leu f rf 

y Leu Thr Leu Gin Glu Met Gly Ala Asp 



285 



ATC GAA GTC ATC AAC CCr rrn /--.^ ^ 

lie Glu val He A^n p'o Aro ^'Z IT ''-^T "'^ 

290 -^1^ Gly Gly Giu Asp Val Ala Asp 

-'^ 300 



CTG CGC GTT CGC TCC -"cr ^r-r- r-^r, , 

Leu Arc Val Arc Ser ^er -^^r eu ^'"^ "-^^ 

305 ■ -^"^ --'^ ''^1 --^^ '^^1 Pro Glu Aso 



Z til IT^ S ^'i Z ^ i" - 

320 -''^ -° '-^ Leu Ala Val Ala Ala 

330 



538 



586 



634 



682 



730 



778- 



826 



874 



922 



970 



L018 



1066 



# 
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-I 5 S Z ei? „™ Z Z Zl Z IT, 

350 

vi? z z s lit z r - - 

g Leu ser Ala Val Ala Asn Gly Leu Lys Leu 

"n ?S 2: - - - - - C.C 0.C 1" CCC ccc 

370 ser Leu Val Val Arg Gly 

380 

I" z z IT r === 

385 ^ '^^^ "l" S« Oly Ala AU Val AU 

?S S^I If. - - - - - cxc 

400 ^-^^ Ser Phe Leu Val Met Gly Leu 

vi? z It: z rit z z r tf ™ 

420 ^ "^^^ lie Ala Thr 

"^^^ 430 

Sfr Z -I ^ Z Je- Z I- ^ -I Sy^ GGC G^CG AAG A. 

Tu llu ser rlr ^ ^^'^^^^^^^^ ^^^^-^^CC ATCGATGGTC 



450 ,55 



1114 



1162 



1210 



1258 



1306 



1354 



1402 



L456 



CCGCTGCGGC 


CGGCAAGGGG 


ACGCTCTCGC 


GCCGTATCGC 


GGAGGTCTAT 


GGCTTTCATC 


1516 


ATCTCGATAC 


GGGCCTGACC 


TATCGCGCCA 


CGGCCAAAGC 


GCTGCTCGAT 


CGCGGCCTGT 


1576 


CGCTTGATGA 


CGAGGCGGTT 


GCGGCCGATG 


TCGCCCGCAA 


TCTCGATCTT 


GCCGGGCTCG 


1636 


ACCGGTCGGT 


GCTGTCGGCC 


CATGCCATCG 


GCGAGGCGGC 


TTCGAAGATC 


GCGGTCATGC 


1696 


CCTCGGTGCG 


GCGGGCGC7G 


GTCGAGGCGC 


AGCGCAGCTT 


TGCGGCGCGT 


GAGCCGGGCA 


1756 


CGGTGCTGGA 


TGGAGGCGAT 


ATCGGCACGG 


TGGTCTGCCC 


GGATGCGCCG 


GTGAAGCTC7 


1316 


ATGTCACCGC 


37CACCGGAA 


OTGCGCGCGA 


AACGCCGCTA 


TGACGAAATC 


CTCGGCAATG 


1876 


GCGGGTTGGC 


CGATTACGGG 


ACGATCCTCG 


AGGATATCCG 


CCGCCGCGAC 


GAGCGGGACA 


1936 
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TGGGTCGGGC GGACAGTCCT TTGAAGCCCG CCGACGATGC 
(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 455 amino acids 

(B) TYPE: amino acid 
(D) •TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:3: 

Met Ser His Gly Ala Ser Ser Arg P.o Ala Thr Ale 



GCACTT 



10 



La Arg Lys Ser Ser 
15 

CIV Leu se. ... r.r VaX xx. P„ ^^^^ 

ser „e. PHe OX. OX. Se. OX. oXu r.r ^ xxe ... 

" 45 

Gly Leu Leu Glu Gly Glu Asp Val He Asn Thr- n^ r 

50 cc ^'^^ Lys Ala Met Gin 

o 0 

AXa „e. OX. .X. ^^^^ 

80 



Gly val Gly Asn Gly Gly Leu Leu Ala Pro Glu Al 



85 



90 



Ala Pro Leu Asp Phe 



95 



Gly Asn Ala Ala Thr Gly Cvs Ara \ w 

100 ^ ^ ^al Gly Val 



110 



Tyr ASP Phe Asp Ser Thr Phe He Gly Asp Ala Ser Leu Thr 



120 



125 



Lys Arg 



Pro Met Gly Ara Val Leu Asn Pr-o r^.. a 

130 ' '^""^ ^'"^ ^^^"^ ^^^y ^'^1 Gin Val 

IJ 3 fin 



140 



Lys Ser Glu h 
145 



so Gly Asp Arg Leu Pro Val Thr Leu Arg Gly Pro Lys 

160 



Thr Fro Thr Pro He Thr Tvr Aro Val ^ 
165 



ro xet Ala Ser Ala Gin Val 



170 



175 



1982 
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III - 0.. „^ ^^^^ 

-1 

205 

Gly Phe Gly Ala Asn Leu Thr Val Gl„ . 

210 215 ""^P Asp Gly Val Arg 

220 ^ 

Thr He Arg Leu Glu Gly Aro n« r, , 

225 i 230 ""^^ Gin Val He Asp 

^•^^ 240 

Val Pro Gly Asp Pro Ser Ser Thr nu 

ser Thr Ala Phe Pro Leu Val Ala Ala Leu 

"^^^ 255 
- va. ..o ox, .sp vaz .k. ,3„ VaX 

270 

Thr Arg Thr Gly Leu lie Len Thy t 

275 "-^^ Glu Met Gly Ala Asp He 

285 



Olu val xle .s„ P„ .eu Ma 01. cl. .1. .^p Val Ma .eu 

'^^^ 300 
Ar. val ser Ser Thr .eu Gly Val Thr Val Pro aiu Asp Ar, 

Ala Pro ser Met lie Asp olu ^r Pro lie .eu Ala Val Ala Ala A^ 

330 

Phe Ala Clu oly Ala Thr Val Me. Asn oiy Leu Glu Glu Leu Arc Val 

Lys Glu ser Asp Arg Leu Ser Ala Val Ala Asn Gly Leu Lys Leu Asn 

' 360 265 

Gly yal ASP cys Asp Glu Gly Glu Thr Ser Leu Val Val Arg Gly Arg 

Pro ASP Gly Lys Gly Leu Gly Asn Ala Ser Gly Ala Ala Val Ala Thr 

395 



His Leu Asp His Arg lie Ala Met S 



er =he Leu Val Met Gly Leu Val 



-^05 410 415 

ser Glu Asn Pro Val Thr Var Asp Asp Ala Thr Met lie Ala Thr Ser 

-125 430 
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- - „e. ... 0., 

445 

Leu Ser Asp Thr Lys Ala Ala 

455 

(2) INFORMATION FOR SEQ ID NO: 4: 

(i.) SEQUENCE CHARACTERISTICS- 

(A) LENGTH : 1673 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 86.. 1432 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:4: 
OTAGCCACAC ATAATTACTA TAGCTAGGAA GCCCGCTATC TCTCAATCCC GCGTCATCGC 
OCCAAAATGT GACTGTGAAA AATCC ATG TCC CAT TCT GCA TCC CCG AAA CCA 

Met ser His Ser Ala Ser Pro Lys Pro 



^ "II Til Tra Trl Ser cT. Sa l'' <^-<^ 

10 " • ,1 "^^^ Thr Gly Glu He Arg He Pro 

20 

Sy ASP IT. ser Hil a"' T' IT ^^-^ 

r .le Ser His Arg Ser Phe Met Phe Gly Gly Leu Ala 

35 40 

ser C?y Tu ^T If, f J ^^A GGC GAG GAC GTC ..TC 

Arg ..e T.hr Oxy Leu Leu Glu Gly Glu Aso Val lie 
50 55 

..sn^rh. ..y Arg Ala Met_Gln Ala Met Gly Ala Lys He ..rg Lys Glu 



60 
112 

160 

208 

256 



3 52 



GGC GAT GTC TGG ATC ATC .AAC GGC GTC GGC AAT GGC 7GC C^G cxr 
.iy ASP val rrp He He Asn Gly Val Gly Asn ofy Cy1 l;' LT. Gin 
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^° 85 
CCC GAA GCT GCG CTC GAT TTr rrn aao, ^ 

.J - - =CC OCO „c 



105 



120 



Sa^ S Trt - GTG CTC ..C CCC TTC 

Lys Arg Pro Met Gly Arg Val Leu Asn Pro Leu 



130 



z z z vii ir. IT. r it -° -g 

140 ""^^ Ala Asp Gly Asp Arg Met Pro 



150 



I- ;s - - 



-^0 ::65 



Gly Ser Asp Val Thr 
250 

CGC .^AC GTG CTG ATG A.AC CCG ACC CGT ACC GGC CTC ATC CTC ^CC ^G 

Arg Asn Val Leu Met Asn Pro Thr Arg Thr Gly Leu He Leu Thr leu 



400 



ACC ATG GGC CTT GTC GGC ACC TAT ri^n . 

M« .eu V. z 21 III iz J- - 



496 



544 



592 



640 



lit Tu ITr IT. ^ ITr Z T T T 
170 ^■JK Gly Leu 

180 

AAC ACG CCG GGC GTC ACC ACC GTC ATC GAG CCG GTC ATG ACC CGC GAC 
Asn Thr Pro Gly Val Thr Thr Val He Glu Pro Val M^t ^hr Arg A^o 

195 200 ' 

CAC ACC GAA AAG ATG CTG CAG GGC TTT GGC GCC GAC CTC ACG GTC G>G 
Hxs Thr Glu Lys Met Leu Cln Gly Phe Gly Ala Asp l;' Jhr vTl cfu 
-05 210 215 

^hr A^n f ^ '''''' ''^^ -"^^ -^CC GGC CAG GGC .^G 

Thr ASP Lys Asp Gly Val Arg His He Arg He Thr Gly Gin Gly Lys 

225 230 

CTT GTC GGC CAG ACC ATC GAC GTG CCG GGC GAT CCG TCA TCG ACC GCC 
.eu val Gly Gin Thr He Asp Val Pro Gly Asp Pro Ser Ser Thr Ala 

240 245 

TTC CCG CTC GTT GCC GCC CTT CTG GTG GAA GGT TCC GAC GTC ACC ATC 880 
.he Pro Leu Val Ala Ala Leu Leu Val Glu Gly Ser Asp Val Thr Zle 



688 



736 



784 



832 



928 
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270 



275 



280 



CAG GAA ATG GGC GCC GAT ATC GAA GTG CTC AAT GCC CGT CTT GCA GGC 
Gin Glu Met Gly Ala Asp He Glu Val Leu Asn Ala Arg lI^ aS Ty 



295 



GGC GAA GAC GTC GCC GAT CTG CGC GTC AGG GCT TCG AAG CTC AAG GGC 
Gly Glu ASP val Ala Asp Leu Arg Val Arg Ala Ser Lys Leu Ty 



310 



350 -,cr 



GCA CGC GGC CTT GAA GCC AAC GGC GTC GAT TGC ACC GAA GGC GAG ATG 
Ala Arg Gly Leu Glu Ala Asn Gly Val Asp Cys Thr Glu G^y G^u Met 
■^65 370 



TCG CTG ACG GTT CGC GGC CGC CCC GAC GGC AAG GGA CTG GGr rrr nr^n 
ser Leu Thr Val Arg Gly Arg Pro Asp Gly LyJ G^y lIu c'^y G^y ^y 

385 

Thr val Ala ihr His i.eu Asp His Arg He Ala Met Ser Phe Leu Val 

400 

ATG GGC CTT GCG GCG GAA AAG CCG GTG ACG GTT GAC GAC AGT AAC ATG ' 
Me Gly Leu Ala Ala Glu Lys Pro Val Thr Val Asp Asp Ser ^n M^t 

■•IS 420 

ill Af ^l"" -'^^ -^^^ CCG GGA TTG GGC 

He Ala Thr Ser Phe Pro Glu Phe Met Asp Mec Met Pro Gly Leu Gly 

435 440 

GCA AAG ATC GAG TTG AGC ATA CTC 7AGTCACTCG "aCAGCGAAAA TATTATTTGC 
Ala Lys i_e Glu Leu Ser He Leu 
445 

GAGATTGGGC ATTATTACCG GTTGGTCTCA GCGGGGGTTT .^ATGTCCAAT CTTCCATACG 



976 



1024 



vTl 51? f T f ° "^^^ ATG ATC GAC GAA TAT CCG 

val val val Pro Pro Glu Arg Ala Pro Ser Met He Asp Zl S 

320 325 

GTC CTG GCG ATT GCC GCC TCC TTC GCG GAA GGC GAA ACC GTG ATG GAC 
V Leu Ala Ue Ala Ala Ser Phe Ala Glu Gly Glu Thr VaJ Met Isl 

340 345 

GGG CTC GAC GAA CTG CGC GTC AAG GAA TCG GAT CGT C-r m nnn ^ 
Gly Leu ASP Glu Leu Arg Val Lys Glu Ser Asp' Ar^ lIu ^a Sa Z '''' 



1072 



1120 



1216 



1264 



1312 



1360 



1408 



1462 



1522 
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TAGAAGGAAT TGCTAGAGCA GCGACGCCGC 1582 
TACTGAAATC CCGGGGGGTC CGGGGATCAA l 



A 



TAACAGCATC AGGAAATATC AAAAAAGCTT 
CTAAGCTTTC TCAAGACTTC GTTAAAACTG 
ATGACTTCAT TTCTGAGAAA TTGGCCTCGC 

■ (2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQU^CE CHARACTERISTICS. 

(A) LENGTH: 449 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: procein 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:5: 

Met Ser His SerAla Ser Pro Lys Pro Ala Thr ai . 

1 5 Arg Arg Ser Git 

10 ^5 

Aa. Leo TH. 0.. Ue x.e P„ 

A., se. PKe .He 01. 01. .eu .1. se. ol. OX. TH. ^ xx, .h. 

4U 45 

Gly Leu Leu Glu Gly Glu Asp Val He Asn Th,- ri, ^ 

50 ^ aj. ixe Asn Thr Gly Arg Ala Met Gin 

^= 60 

„e. OX. .X. ^, 

• - 80 
=lv vax 01. .3„ OX. C.S Leu .eu 0X„ P„ OXu AXe AX, Leu Asp Phe 

90 55 

Gly Asn Ala Glv Thr r'w ai= » 

a vjiy xnr Oxy Ala Arg Leu Thr Mf»r r'-/ r -., i- - 

^00 -'^^ Gly Thr 

110 

Tyr Asp Met lys Thr Ser Phe He riv, a=« - 

115 -^^P -^^^ -e-^ Ser Lys Arg 

Pro Met Glv A^^a - > 

.^^ . A.g .al Asn Pro Leu Arg Glu Met Gly Val Gin Val 

140 



642 



1673 
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Glu Ala Ala Asp Glv Asn Ar-r^ r, 

145 ^ ^5^^ Pro Leu Thr Leu He Gly Pro Lys 

160 



Thr Ala Asn Pro He Thr- iv,,- » , 

Xle Thr Tyr Arg Val Pro Met Ala Ser Ala Gin Val 

" 175 

Lys Ser Ala Val Leu Leu Ala ri„ r 

180 Thr Pro Gly Val Thr Thr 

i 190 
val Xle OU Me. T.. .^p „.3 T.. =x„ 

205 

Gly Phe Gly Ala Asp Leu Thr Val n„-Pv, . 

210 '^^^ Asp Lys Asp Gly Val Arg 

220 

His lie ne Th. Cl. 01„ 01. Val 01. al„ ... x,, 

240 

Val Pro Gly Asp Pro Ser Ser Th,- ai r,w 

245 ^ ""'^ Ala Ala Leu 

255 

Leu Val Glu Gly Ser Asd Val th^ n , 

260 "^""^ Leu Met Asn Pro 

270 

Thr Arg Thr Gly Leu He L*.ii tHt- r 

y Leu Thr Leu Gin Glu Met Gly Ala Asp He 

'^^O 285 

Glu Val Leu Asn Ala Ara Leu Ala rUr m,, 

290 "oc ^ Asp Val Ala Asp Leu 



=95 .00 



Arg val Arg Ala Ser L.s Leu Lys Gly Val Val Val Pro Pro Glu Arg 

320 

Ala Pr= ser Met lie Asp olu T.r P.-, val .eu Ala He Ala Ala Ser ^ 



330 



335 



Phe Ala Glu Gly Glu Thr Val Met Asp Gly Leu Asp Glu leu Ara Val 

345 250 

■^ys Glu ser Asp Arg Leu Ala Ala Val Ala Arg Gly Leu Glu Ala Asn 

-^SO 365 

•Uy Vai Asp Cys Thr Glu Oly Giu Met ie- " eu '-^^ '^al • r',, > 

-7 Q ^ -cu X... vai .nra Giy Arg 



380 



Pro Asp Gly Lys Gly Leu Glv Glv Glv '-^.i ^ - -u^ . 

235 - — "-a -^-r -13 :.eu Asp 



295 



400 
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His A., n. Ma se. P.e .eu V,. „^ ^^^^ 

V,. vaj .3p .sp se. .s„ „« ne Ma ^./se. Z ox. 

430 

Phe Met Asp Met Met Pro Gly Leu Gly Ala rv« n 

435 He Glu Leu Ser He 



440 



Leu 



(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 1500 base pair« 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(ix) FEATURE: 

(A) NAME/ KEY: CDS 

(B) LOCATION: 34.. 1380 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

GTGATCGCGC CAAAATGTGA CTGTGAAAAA TCC ATG TCC CAT TCT GCA TCC CCG 

Met Ser His Ser Ala Ser ^ro 

1 5 

^ Pro Til Tl f ^ f " GAA ATC CGC 

Lys Pro Ala T.hr Ala Arg Arg Ser Glu Ala Leu Thr Gly Glu He Arg 

^5 20 

ATT CCG GGC GAC AAG TCC ATC TCG CAT CGC TCCTTC ATG TTT GGC GGT 
He Pro Gly Asp Lys Ser He Ser His Arg Ser P.he Met Phe Gly Gly 
25 30 35 

CTC GCA TCG GGC GAA ACC CGC ATC .^CC GGC CTT CTG GAA GGC GAG GAC 
.eu Ala ser Glv Glu Thr Arg He T.^-.r Gly leu Leu Glu Gly Glu Asp 

::0 55 



54 



102 



150 



198 
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vIJ z ?s - s - z s ^ - - 



" 70 



80 85 



S ± s= s= - s 



100 



iJO 135 



™ SI ^ =^ - ir. Z Z Z Z Z 

jl - - - - - - - z 

ISO 

S: 51? ^ SI? .S S X - I- - ?I? - - - 



GGT CTC AAC ACG CCG GGC GTC ACC ACC GTC ATC GAG CCG GTC ATG ACC 
Gly Leu Asn Thr Pro Gly Val Thr Thr Val :ie Glu Pro Val Met ^hr 

190 T^95 

CGC GAC CAC ACC GAA AAG ATG CTG CAG GGC TTT GGC GCC GAC CTC ACG 
Arg ASP His Thr Glu Lys Met Leu Gin Gly Phe Gly Ala Asp lIu T^r 
■ 205 210 215 

vll gTu i^r T"" -^^C GGC CAG 

-/al Glu Thr Asp Lys Asp Gly Val Arg His He Ara lie Thr Gly Gin 

225 230 

?iv "^^ "-'^ 'C'^ ^CG 

Gly Lys Leu .ai Gly Gin Thr He Asp Val Pro Gly . Asp Pro Ser Ser 

240 245 



246 



294 



342 



CGC CTC ACC ATG GGC CTT GTr rrr a,,™ 

Ar, Leu Thr Met oly Z Z Z ^hr 3 Z H« f ' "° 

105 ^ ^ "^'^ T^J^ Ser Phe 

110 ij_5 



48& 



534 



582 



630 



678 



726 



774 



• # 
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ACC GCC TTC CCG CTC GTT GCC rrr r-^ ^ 

- - - J. 

"260 



ACC ATC CGC AAC GTG CTG ATP sao 

Thr lie Arg Asn Va^ lIu nit Pro JJ'' f'''' "^"^ ^TC 

265 270 ''''^ lie Leu 



ACC TTG CAG GAA AfG GGC GCC GAT ATC r^A p-p^ 

Thr Leu Gin Glu Met Gly Ala Asn n CTT 
280 ^ ""^^ Asn Ala Arg Leu 



295 



GCA GGC GGC GAA GAC GTC GCC GAT crr rnr^ 

Al^ Oly Gly cm A=p val a" ^ =f f = CTC 

300 y Arg /al «rg Ala Ser Lys Leu 



310 



340 



11^ z LI r r °- 

345 Asp Arg Leu Ala 

355 

GCG GTC GCA CGC GGC CTT GAA GCC .:^AC GGC G^C G'T -r ^ r-.. 
.la V.1 .la .rg Oly .eu Olu .la Asn IZ ^ ^ 

I- ;s ?ii 1% z =5 S 

385 390 

r?v r?"" ^''T '^AT CGT ATC GCG ATG AGC TTC 

Gly Gly Thr Val Ala Thr His Leu Asp His Arg He Ala Met Ser III 

400 405 

■lu z ''a A^a '''''' ^- 

I. t ely ^eu ^.a Axa Oxu .ys Pro Val Thr Val Asd Asd Ser 
•^1^ 415 420 

^ mIc aS Th? le^ f^l f ^ 

j,c " '"^ ^-^^ Met Asp Met Met Pro Gly 

Jin ^ 



822 



870 



918 



966 



1014 



AAG GGC GTC GTC GTT CCn rrr rj,^ ^ 

Lys Gly val Val 511 Pro Pro oTu Ara A^^ f ^ 

Pro Glu Arg Ala Pro Ser Met He Asp Glu 

325 

TAT CCG GTC CTG GCG ATT GCC GCC TCC TTC GCr r^^ rrr^ . 

Tyr Pro Val Leu Ala He Ala Ala S^r Phe 111 ^'^^ 



1110 



1158 



1206 



1254 



L302 



1350 
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(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 449 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

Met ser His Ser Ala Ser Pro Lys Pro Ala Thr Ala Ar. Ar. Ser Clu 

10 . 15 

Ala Leu Thr Gl. Glu lie Arg He Pro Gly Asp Lys Ser He Ser His 

25 30 
Ar, s„ Phe „et Phe Gly Oly Leu ,U Se. Gly Glu Thr Ar, lU Thr 

4 5 

Gly Leu Leu Glu Gly Glu Asp Val lie Asn ^hr G'v ai » 

:q * ^-^y •■'-9 -^^3 Met Gin 

Ala Met Gly Ala Lys lie Arg Lys Glu Gly Asp Val Trp He He Asn 

'5 80 
Gly val Gly Asn Gly Cys Leu Leu Gin Pro Glu Ala Ala Leu Aso Phe 

90 95 

Gly Asn Ala Gly Thr Gly Ala Arg Leu Thr Met Gly Leu Val Gly Thr 

Tyr Asp Met :^ys Thr J 

Pro Met Gly Arg Val Leu Asn Pro Leu Arg Glu Met Gly Val Gin Val 

140 



105 

.^er Phe lie Gly Asp Ala Ser Leu £er Lys Arg 
i20 - -iq 



1400 



™ gI^ s= If. S Z 

440 

TATTATTTCC GAOATTCOCC A^ATTACCC CTTCCTCTCA GCGCCCC^ ^TCTCCAAT 14S0 
CTTCCATACG TAACAGCATC AGGAAATATC AAAAAAGCTT 



1500 
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Clu Ma AXa Asp CI. Asp p., 

160 

Th. P.O ne Thr 



170 „5 



LVS se. M» ... ... 

•^^5 1 on 



190 

V.1 ne Olu P„ V* Met Thr Ar, Asp His Thr olu ..a Met Leu Oln 

205 

Oly Phe 01. AU ASP .eu Thr V.l 01. Asp .,s Asp 01, val Ar, 

'^^^ 220 
His ne Ar, tie TKr 01. Oln 01. ..s .eu Val 01. 01„ Thr He Asp 

vel Pro 01. ASP Pro Ser Ser Thr Ale Phe Pre .eu Val Ala Ala Z 

250 255 
Leu val Olu 01. ser Asp Val Thr He Ar, As„ Val Leu Met Asn Pro 

265 270 

Thr Arg Thr Gly Leu He Leu Thr Leu Gin Glu Met Gly Ala Asp He 

280 285 

Glu val Leu Asn Ala Arg Leu Ala Gly Gly Glu Asp Val Ala Asp Leu 



295 



300 



Arg val Arg Ala Ser Lys Leu Lys Gly Val Val Val Pro Pro Glu Arg 



315 



320 



Ala Pro ser Met lie Asp Glu Tyr Pro Val Leu Ala He Ala Ala Ser 

330 

Phe Ala Glu Gly Glu Thr Vai Met Asp Gly Leu Aso Glu Leu Arg Val 

345 

Lys Glu ser Asp Arg Leu Ala Ala Val Ala Arg Gly Leu Glu Ala Asn 

Gly val ASP Cys Thr Glu Gly Glu Met Ser Leu Thr Val Arg Gly Arg 

-■5 3 90 

Pro ASP Gly Lys Gly Leu Gly Gly Gly Thr Val Ala Thr His Leu Asp 

395 400 



360 365 
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Kis X.e s„ ... 

415 



V,. ... vax ..p 3« „ „e „a TH. Se. PHe ..o 

430 

Phe Met Asp Met Met Pro Gly Leu Gly Ala Lv, n ^i 

435 ^-^^ He Glu Leu Ser He 

445 

Leu 

(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 423 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: prote 



in 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

ser Leu Thr Leu Gin Pro He Ala Arg Val Asp Gly Thr He Asn Leu 

10 15 

Pro Gly Ser Lys Thr 1 c^v -v 

^ys i..r /al ^er Asn Arg Aia Leu Leu Leu Ala Ala Leu 

Ala His Gly Lys Thr Val Leu Thr Asn Leu Leu Asp Ser Asp Asp Val 

40 45 

Arg His Met Leu Asn Ala Leu Thr Ala Leu Gly Val Ser Tyr Thr Leu 

55 60 

ser Ala Asp Arg Thr Arg Cys Glu He He Gly Asn Gly Gly Pro Leu 

■° '5 80 

His Ala Glu Gly Ala Leu Glu Leu Phe Leu Gly Asn Ala Gly Thr Ala 



?0 9 5 



Met Arg Fro Leu Ala Ala A 

100 -nc 

-05 2^i^o 



a Leu Cys Leu Gly Ser Asn Asp He Val 
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Leu Thr ex. Clu P„ Me. ..3 clu ne „,3 V.X 

125 

ASP Ala Leu Arg Leu Gly 01. Ala t.s He Thr Leu Olu cl„ 

^■•^ 140 
As„ ,vr ..o P.O Leu Ar, Leu Oln 01, oi, .he Thr 01, 01, as„ val 
i 160 
ASP val A3P 01, ser val Ser Se, oln PHe Leu Ala Leu Leu Met 



170 



Ala Leu Ala oiu Asp val lie Ar, lie L,3 01, Asp 

190 



Leu val se, L,s P„ lie Asp lie THr Leu As„ Leu «e. L,s TH. 

^UO 205 
Phe 01, val Olu ne olu As„ oln „1. ol„ oln Phe Val Val L,s 

220 

01, 01, 01„ se. oln se. P.„ ol, THr Leu Val olu 01, .^p 

Ala ser se. Ala s« T,. Phe Leu Ala Ala Ala Ala lie L,s 01, ol^ 

250 255 
Th. v.l L,s val Th. 01, „e 01, A„ As„ Se. Met 01„ 01, Asp lie 

263 270 

Arg Fhe Ala Asp Val Leu Glu Lvs Met Gly Ala T-r Tlo r 

275 ^ ^"^ Cys Trp Gly 

280 285 

ASP ASP ne Ser c,s Th. A„ 01, olu Leu Asn Ala lie Asp Met 

^9= 300 

ASP Mec Asn His lie Pro Asp Ala Ala Mec Thr rie Ala Thr Ala Ala 

320 

Leu Fhe Ala Lys Gly Thr Thr Arg Leu Arg Asn He Tyr Asn Tro Arg 
^25 330 335 

val lys Glu Thr Asp Arg Leu Fhe Ala Met Ala Thr Glu Leu Arg Lvs 

^-5 350 

Val Gly Ala Glu Val Giu c-. ' ■,(-•<• ui ^ • 

.ai L.u ._..a oxv his .^sp Tyr ..e Arg He Thr Fro 

^=•5 360 355 
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Pro Glu Lys Leu Asn Phe Ala ri„ n 

370 lie Ala Thr Tyr Asn Asp His Arg 

^ 380 ^ 

Met Ala Met Cys Phe S^r r^., ^, ■, 

385 Ala Leu Ser Asp Thr Pro Val Thr 

400 

lie Leu ASP Pro Lys Cys Thr Ala Lys Thr Phe Pro Asp TVr 



410 - 

415 



Gin Leu Ala Arg He Ser Gin 
420 

(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 1377 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 
CCATGGCTCA CGGTGCAAGC AGCCGTCCAG CAACTGCTCG TAAGTCCTCT GGTCTTTCTG 
GAACCGTCCG TATTCCACCT C-ACAAGTCTA TCTCCCACAG CrCCr.CATG TTTGGAGGTC 
TCGCTAGCGG TCAAACTCGT ATCACCGGTC TTTTGGAAGG TGAAGATGTT ATCAACACTG 
GTAAGGCTAT GCAAGCTATG GGTGCCAGAA TCCGTAAGGA AGGTGATACT TGGATCATTG. 
ATGGTGTTGG T.^CGGTGGA CTCCTTGCTC CTGAGGCTCC TCTCGATTTC GGTAACGCTG 
CAACTGGTTG CCGTTTGACT ATGGGTCTTG TTGGTGTTTA CGATTTCGAT AGCACTTTCA 
TTGGTGACGC TTCTCTCACT ..AGCGTCCAA TGGGTCGTGT C-TTGAACCCA CTTCGCGAAA 
TGGGTGTGCA GGTGAAGTCT GAAGACGGTG ATCGTCTTCC AGTTACCTTG CGTGGACCAA 
AGACTCC.^C GCCAATCACC 7ACAGGGTAC CTATGGCTTC C3CTC.=.:.GTG AAGTCCGCTG 
TTCTGCTTGC TGGTCTCAAC ACCCCAGGTA TCACCACTGT TATCGAGCCA ATCATGACTC 
GTGACCACAC TGAAAAGATG CTTCAAGGTT rTGGTGCTAA CCTTACCGTT GAGACTGATG 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
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cxcccc^r ccc..cc.«: ^^^^ 

ATOTTCCOC XO.TCCATCC .CT.C^CTT TCCC„TCOX T.C^CCTTC CXT=„c«0 
OTTCC0.CCT CACC^CCTT ^^^^^ 

CT™* ..CXCCMC .COTCrT^ 

ACTCOn.. c™c«,,„ ^^^^ 

CTCCTCCTTC T.^c O.o™,.c« TTCTCOCTO. ..CCCTCC. ..a=CT.«. 
OTOCT.CCOT t™„cO=. ™c«o«C TCC^™ OC^OCO.C COTCTTT™ 

cT=TC=c«. caarcTc«a ctcmcco^ ™.ttocoa «=«c=T0.a .cttcctcc 

TCOTOCCTOO TCG^CTOAC OCT..«CTC TCC=CT«COC ^.O.A.CA CC«=^,cT. 
CCCACC^C. TCACCGTATC OCTATOAGcT .CCTCC^T GOOTCTCGTr TCTG««CC 
CTGTTACOT TOATCATGCT ACTATGATCO CTACTACCTT CCCAOAGTTC ATCCATrTGA 
TGGCTGOTCT TGGACCTAAO ATCGAACTCT CCGACACTAA CGCTGCTTCA TGAGCTC 
(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 318 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME/ KEY: CDS 

(B) LOCATION: S7..317 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
AGATCTATCG ATAAGCTTGA TGTAATTGGA GGAAGATC.^A AATTTTC^AT CCCCATTCTT 

CGATTGCTTC AATTGAAGTT TCTCCG ATG GCG CAA GTT AGC AGA ATC TGC AAT 

Met Ala Gin Val Ser Arg lie Cys Asn 



720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1377 



60 
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GGT GTG CAG AAC CCA TCT CTT ATC TCC AAT CTC TCG AAA TCC AGT CAA 
G y Val Gin Asn Pro Ser Leu He Ser Asn Leu Ser ^ Ser sTr ^oTn 

20 25 

CGC AAA TCT CCC TTA TCG GTT TCT CTG AAG ACG CAG CAG CAT CCA CGA 
Arg Lys Ser Pro Leu Ser Val Ser Leu Lys Thr Gin Gin His Pro A« 

35 40 

GCT TAT CCG ATT TCG TCG TCG TGG GGA TTG AAG AAG AGT GGG ATG ACG 
Ala Tyr Pro lie Ser Ser Ser Trp Gly Leu Lys Lys Ser Gly nit ^hr 
45 50 55 

TTA ATT GGC TCT GAG CTT CGT OCT CTT AAG GTC ATG TCT TCT GTT TCC 

3er 
70 



' -^-^ AACi GTC ATG TCT TCT GTT TCP 

Leu lie Gly Ser Glu Leu Arg Pro Leu Lys Val Met Ser Ser vll Ser 



161 



209 



257 



305 



ACG GCG TGC ATG C 

Thr Ala Cys Met 318 
75 



(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 77 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

Met Ala Gin Val Ser Arg He Cys Asn Gly Val Gin Asn Pro Ser Leu 

He Ser Asn Leu Ser Lys Ser Ser Gin Arg Lys Ser Pro Leu Ser Val 
20 25 30 

Ser Leu Lys Thr Gin Gin His Pro Arc Ala Tvr Pro He Ser Ser ^er 
35 40 45 

Trp Gly Leu Lys Lys Ser Gly Met Thr Leu lie Gly Ser Glu Leu Arg 
50 55 so 

Pro Leu Lys Val Met Ser Ser Val Ser Thr Ala Cvs Met 
65 -c " 



38-21(10660)A 

(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 402 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE^ TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 87.. 401 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
AGATCTATCG ATAAGCTTGA TGTAATTGGA GGAAGATCAA AATTTTCAAT CCCCATTCTT 

CGATTGCTTC AATTGAAGTT TCTCCG ATG GCG CAA GTT AGC AGA ATC TGC AAT 

Met Ala Gin Val Ser Arg He Cys Asn 

1 5 

GGT GTG CAG AAC CCA TCT CTT ATC TCC AAT CTC TCG AAA TCC AGT CAA 
G y val Gin Asn Pro Ser Leu He Ser Asn Leu Ser Lys Ser Ser oTn 

20 25 

CGC AAA TCT CCC TTA TCG GTT TCT CTG AAG ACG CAG CAG CAT CCA CGA 
Arg Lys Ser Pro Leu Ser Val Ser Leu Lys T.'.r Gin Gin His Pro Trg 
^° 40 



GCT TAT CCG ATT TCG TCG TCG TGG GGA TTG .^:.G AAG AGT GGG ATG ACG 
Ala Tyr Pro xle Ser Ser Ser Trp Gly Leu Lys Lys Ser Gly Met Thr 
45 50 55 

TTA ATT GGC TCT GAG CTT CGT CCT CTT AAG GTC ATG TCT TCT GTT TCC 
Leu lie Gly Ser Glu Leu Arg Pro Leu Lys Val Met Ser Ser Val Ser 
=° o5 70 



rnr. "''^ ^"^^ ^"^^ ^-^^ CCC ATT AGA GAA ATC 

Thr Ala Gxu :.ys Ala Ser Glu He Val Leu Gin Pro lie Arg Glu He 

75 SO 35 

TCC GGT CTT ATT .^AG TTG CCT GGC TCC .:^G TCT CTA TCA AAT AGA ATT 

.er G.y Leu Z.e Lys Leu Pro Gly Ser lys Ser Leu Ser Asn Arg lie 

-00 105 

c 



60 
113 

161 

209 

257 

305 

353 

401 

402 
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(2) INFORMATION FOR SEQ ID NO: 13.- 

(i) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 105 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi, SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

Met Ala Gin Val Ser Aro n 

I Arg lU cy. Asn Oiy v,l cl„ Asn P„ ser .eu 

ne se. J= 

s« - ™^ P. 

45 

Trp Gly Leu Lys Lys Ser Gly Met Th. r 

50 55 "^"^^ Gly ser Glu Leu Arg 

60 

Pro Leu Lys Val Met S*»r- ,r i 

ser ser Val Ser Thr Ala Glu Lys Ala Ser Glu 

80 

He val Leu Gin Pro He Arg Glu He Ser Glv r 

85 lie Lys Leu Pro 

Gly Ser Lys Ser Leu Ser Asn Arg He 

105 

(2) INFORMATION FOR SEQ ID NO: 14: 



95 



(i) SEQUENCE CHARACTERISTICS- 

>A) LENGTH: 233 base oairs 
■3) TYPE: nucleic acid 
''O STRANDEDNESS: double 
(D) TOPOLOGY: linear 

(li) MOLECULE TYPE, (genomic) 



(IX) FEATURE: 

■A) MAME/KEY: CDS 

■3) LOCATION: 1-J..232 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO-14- 



5 10 



20 25 



iTr ^11 z III en iz iTr r r ^ 

30 ""^^ Lys Asn Ser Ala Asn 



35 40 



■4 



40 



45 



Leu Lys Lys Asp Ser :ie Phe Met Gin Lys Phe Cys Ser =he Arg He 

55 50 



97 



145 



193 



TCT ATG TTG GTT TTG AAA AAA GAT TCA ATT TTT ATG TAA :.^n 
Ser Met Leu Val Leu Lys Lys Asn t7 nu ^ ^^'^ '^^'^ 

45 50 ^ ^^"^ Met Gin Lys Phe Cys 

55 60 
TCC TTT AGG ATT TCA GCA TCA GTG GCT ACA GCC TGC ATG C 
ser Phe Arg He Ser Ala Ser Val Ala Thr aS nit 
^5 70 

(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 73 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

Met Ala Gin He Asn Asn Met Ala Gin Gly He Gin Thr Leu Asn Pro 

5 10 15 

Asn ser Asn Phe His Lys Pro Gin Val Pro Lys Ser Ser Ser Phe Leu 
-0 25 30 

val Phe Gly Ser Lys Lys Leu Lys Asn Ser Ala Asn Ser Met Leu Val 
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Ser Ala Ser Val Ala Thr Ala Cys Met 
^5 70 

(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 352 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 49.. 351 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 



AGATCTGCTA GAAATAATTT TGTTTAACTT Taar's,^oo»^ 

n^iiiAACTT TAAGAAGGAG ATATATCC ATG GCA CAA 

Met Ala Gin 

1 

ATT AAC AAC ATG GCT CAA GGG ATA CAA ACC CTT AAT CCC AAT TCC AAT 
lie Asn Asn Met Ala Gin Gly He Gin Thr Leu Asn Pro ^n Ser ^n 

10 2.5 

TTC CAT AAA CCC CAA GTT CCT .^AA TCT TCA AGT TTT C-- G-" --t g^A 
Phe H.s Lys Pro Gin Val Pro Lys Ser Ser Ser Phe Val ;he G^y 

25 30 35 

TCT AAA AAA CTG AAA AAT TCA GCA AAT TCT ATG TTG GTT TTG AAA AAA 
ser Lys Lys Leu Lys Asn Ser Ala Asn Ser Met Leu Val Leu ^ ^ 

45 50 

GAT TCA ATT TTT ATG CAA AAG TTT TGT TCC TTT AGG ATT TCA GCA TCA 
ASP ser He Phe Met Gin Lys Phe Cys Ser Phe Arg He Ser Ma Ser 
=5 60 65 

vll Til Jhr IT ''"^ -^^ 

/al Ala Thr Ala Gin Lys Pro Ser Glu He Val Leu Gin Pro He Lys 

"^5 30 

GAG ATT TCA GGC ACT GTT TTG CCT GGC TCT AAA TCA TTA TCT .:^T 

Glu lie .er Gly Thr Val Lys Leu Pro Gly Ser Lys Ser Leu Ser Asn 

PO 95 



57 



105 



153 



201 



249 



.97 



345 
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AGA ATT C 

Arg He 352 
100 



(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) Length : lOl amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

Met Ala cm lie Asn Asn Met Ala Gin Gly He Gin Thr Leu Asn Pro 

10 15 

Asn Ser Asn Phe His Lys Pro Gin Val Pro Lys. Ser Ser Ser Phe Leu 

25 30 

val Phe Gly Ser Lys Lys Leu Lys Asn Ser Ala Asn Ser Met Leu Val 

40 45 

Leu Lys Lys Asp Ser lie Phe Met Gin Lys Phe Cys Ser Phe Arg He 

Ser Ala Ser Val Ala Thr Ala Gin Lys Pro Ser Glu He Val Leu Gin 

^5 30 

Pro He Lys Glu He Ser Gly Thr Val Lys Leu Pro Gly Ser Lys Ser 

90 95 

Leu Ser Asn Arg He 
100 

(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

lA) LENGTH: 28 amino acids 
i3) TYPE: amino acid 
C) STRANDEDNESS: single 
O) TOPOLOGY: linear 

;ii) MOLECULE TYPE: peptide 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

Xaa His 01. Ala Ser Se. p.o .la Thr Ala A.. .,3 Ser Ser Cly 

10 

Leu xaa Gly Thr Val Arg He Pro Gly Asp Lys Met 

25 

2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

Ala Pro ser Met lie Asp Glu Tyr Pro He Leu Ala Val 
^ 10 
) INFORMATION FOR SEQ ID NO: 20: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino. acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 

lie Thr Gly Leu Leu Glu Gly Glu Asp Val He Asn Thr Gly Lys 
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(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE; TYPE: Other nucleic acid 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 
ATGATHGAYG ARTAYCC 



(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other nucleic acid 



(xi) SEQUENCE DESCRIPTION: £EQ ID NO: 22: 
GARGAYGTNA THAACAC 



(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 17 base pairs 
(3) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other nucleic 



(xi) SEQUENCE DESCRIPTION: £EQ ID NO:23: 



(A) DESCRIPTION: Synthetic DNA 



17 



(A) DESCRIPTION: Synthetic DNA 



17 



(A) DESCRIPTION: Synthetic DNA 



GARGAYGTNA 7HAATAC 



17 
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(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other nucleic acid 

(A) DESCRIPTION: Synthetic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:24: 
CGTGGATAGA TCTAGGAAGA CAACCATGGC TCACGGTC 

38 

(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other nucleic acid 

(A) DESCRIPTION: Synthetic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 
GGATAGATTA AGGAAGACGC GCATGCTTCA CGGTGCAAGC AGCC 44 
(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: Other nucleic acid 

(A) DESCRIPTION: Synthetic DNA 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 
GGCTGCCTGA TGAGCTCCAC AATCGCCATC GATGG 



32 
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(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE W: Other nucleic acid 

(A) DESCRIPTION: Synthetic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 
CGTCGCTCGT CGTGCGTGGC CGCCCTGACG GC 
(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other nucleic acid 

(A) DESCRIPTION: Synthetic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 
CGGGCAAGGC CATGCAGGCT ATGGGCGCC 

29 

(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other r.ucleic acid 

(A) DESCRIPTION: Synthetic DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 
CGGGCTGCCG CCTGACTATG GGCCTCGTCG G 




_1 oc 
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(2) INFORMATION FOR SEQ ID NO:30: 

(i) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE ^PE: protein 



15 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:30: 

Xaa His Ser Ala Ser Pro Lys Pro Ala Thr Ala Arg Arg Ser Glu 
^ 10 
(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other nucleic acid 

(A) DESCRIPTION: Synthetic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 
GCGGTBGCSG GYTTSGG 

(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



17 



(ii) MOLECULE TYPE: pepc 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:32: 

Pro Cly .sp .,3 se. Ue Ser His .r, Se. P.e Met P.e CI. ci. .eu 



^° 15 



(2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE* CHARACTERISTICS- 

(A) LENGTH: 13 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 

Leu ASP Phe Gly Asn Ala Ala Thr Gly Cys Arg Leu Thr 
^ 10 
(2) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS:' single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other nucleic acid 

(A) DESCRIPTION: Synthetic DNA 

(xi) SEQUENCE DESCRIPTION: £EQ ID NO: 34: 
CGGCAATGCC GCCACCGGCG CGCGCC 

26 

(2) INFORMATION FOR SEQ ID NO: 35: 

(i) SEQUENCE CHARACTERISTICS: 
lA) LENGTH: 49 base pairs 
(3) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



38.21(10660)A 

(ii) MOLECULE TYPE: Other nucleic acid 

(A) DESCRIPTION: Synthetic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 
GGACGGCTGC TTGCACCGTG AAGCATGCTT AAGCTTGGCG TAATCATGG 
(2) INFORMATION F©R SEQ ID NO: 36: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other nucleic acid 

(A) DESCRIPTION: Synthetic DNA 



49 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 
GGAAGACGCC CAGAATTCAC GGTGCAAGCA GCCGG 
(2) INFORMATION FOR SEQ ID NO: 37: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 5 amino acids 
(3) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



( ix ) FEATURE : 

(A) NAME/KEY: Modif ied-sice 
t3) LOCATION: 2 

O) OTHER INFORMATION: , r.oce= •Xaa at position 2 is Gly 
Ser, Thr. Cys, Tyr, Asn, Gin, Asp, or Glu" 

( ix ) FEATURE : 

(A) NAME/KEY: Modif ied-site 
.=) LOCATION: 4 

OTHER INFORMATION: r.ote= • vaa at oosition 4 is Ser 
or Thr" 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37: 

Arg Xaa His Xaa Glu 
1 5 



(2) INFORMATION FOR SEQ ID NO: 38: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 4 

(D) OTHER INFORMATION: /note= "Xaa at position 4 is Ser 
or Thr- 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:38: 

Gly Asp Lys Xaa 

1 



(2) INFORMATION FOR SEQ ID NO: 39: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME/KEY: Modi f ied-site 

(B) LOCATION: 4 

O) OTHER IMFORMATION: , note= "Xaa at position 4 is Ala, 
Arg, Asn, Asp, Cys , Gin, 31u. Gly, His, lie. Leu, 
Lys, Met, ?he. Pro, 3er, Thr, 7rp, Tyr, or Val" 



■139- 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39: 



Ser Ala Gin Xaa Lys 
1 5 

(2) INFORMATION FOR SEQ ID NO: 40: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 2 

(D) OTHER INFORMATION: /note= "Xaa at position 2 is Ala 
Arg, Asn, Asp, Cys, Gin, Glu, Gly, His, He, Leu 
Lys, Met, Phe, Pro, Ser, Thr, Trp, Tyr, or Val- 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40: 

Asn Xaa Thr Arg 
1 

(2) INFORMATION FOR SEQ ID NO: 41: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1287 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(ixj FEATURE: 

(A) NAME/KEY: CDS 

(3) LOCATION: 1 . . 1287 
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(xi) SEQUENCE DESCRIPTION: SEQ ID N0:41: 



10 ,5 



Z ^ - ^ z s z S s 



25 30 



40 45 



- ==j 

55 60 

z z z - ^ J- - ,=^0 

'° 75 80 

Pro Ser lIu T"" f ^ ATT CGC CTG 

Pro Glu Ser Leu Leu Asp Val Gly Asn Ser Gly Thr Thr He Arg Leu 

85 90 95 

ATG CTC GGA ATA TTG GCG GGC CGT CCT TTT TAG AGC GCG GTA GCC GGA 
Met Leu Gly lie Leu Ala Gly Arg Pro Phe Tyr Ser Ala Val 111 tiy 
100 105 uo 

GAT GAG AGC ATT GCG AAA CGC CCA ATG AAG CGT GTG ACT GAG CCT -TG 
ASP Glu ser He Ala Lys Arg Pro Met Lys Arg Val Thr Glu Pro lIu 
115 120 125 

AAA AAA ATG GGG GCT AAA ATC GAC GGC AGA GCC GGC GGA GAG TTT ACA 
Lys Lys Met Gly Ala Lys He Asp Gly Arg Ala Gly Gly Glu Phi Jhr 

135 

CCG CTG TCA GTG AGC GGC GCT TCA TTA AAA GGA ATT GAT TAT GTA TCA 
Pro Leu Ser Val Ser Gly Ala Ser Leu Lys Gly He Asp Tyr vTl sir 

"0 155 

CCT GTT GC;< AGC GCG C.:^ ATT TCT GCT GTT TTG CTG GCC GGA T^A 

Pro val Ala Ser Ala Gin He Lys Ser Ala Val Leu Leu Ala Gly len 



48 



96 



144 



192 



240 



288 



336 



384 



432 



480 



528 



• 14.1- 
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CAC ACT GAG CGG ATG CTT TCr rr"r< T^mm 

His Thr Glu Arg Met Z sTr mI III "!T ^ r'^ 

195 Leu Ser Glu Asp 

-iuo 205 



CAA ACG AGT GTT T^C ATT rr-P 

Gin Thr ser Val Ifr gJv gJv rf ^""^ ^^-^ ^^-^ 

210 Leu Thr Ala Ala Asp 

220 



nl "I -I III GCG TTT TTC CTT GCT GCT 

225 ^ 230 ""^^ ^''^ Ala Ala 

240 

GGC GCG ATG GTT CCA AAr hrr no» 

V. £ - - - - - - - - 0.. ™ 

250 255 

A^' Pro ?hr Jhr G^' T 

260 Met Gly Ala 

265 270 

^ iz ne p.^^ iT^ ?f ^ r ^'^^ ™^ 

275 ^ P^o Tyr Gly 

280 285 

GAT TTG ATT ATA GAA ACG TCA TCT CTA AAG GCA GTT GAA ATC GGA GGA 
ASP Leu lie He Glu Thr Ser Ser Leu Lys Ala Val Glu lie Gly Gly 
" 295 300 

GAT ATC ATT CCG CGT TTA ATT GAT GAG ATC CCT ATC ATC GCG CTT CTT 
ASP lie lie Pro Arg Leu He Asp Glu He Pro He He Ala Leu III 

315 320 

f^"" J^"" ^^'^ ''^^ ATT .^AG GAC GCG GCA GAG CTA 

Ala Thr Gin Ala Glu Gly Thr Thr Val He Lys Asp Ala Ala Glu Leu 
225 330 335 

AAA GTG AAA GAA ACA AAC CGT ATT GAT ACT GTT GTT TCT GAG CTT CGC 1056 
Lys Val Lys Glu Thr Asn Arg He Asp Thr Val Val Ser Glu Leu Arg 



34 5 3 50 



.^G CTG GGT GCT GAA ATT GAA CCG ACA GCA GAT GGA ATG AAG GTT TAT 
Lys Leu Gly Ala Glu He Glu Fro Thr Ala Aso Gly Met Lys Val Tyr 
'55 360 365 



576 



624 



672 



720 



768 



816. 



864 



912 



960 



1008 



1104 
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Z III - TCC .CC CAC CO. CAT 

370 ""^^ ""^^ Val ser Ser His Gly Asp 

"'^ 380 



390 

-•^^ 400 



1152 



"I LI 'S?* z SI s - - -«o 

385 . He Thr Glu Glu 



S nl JI^ ^ ^ - Z S JII ?II IS I- S ?S 
-ni^ ^J^^JI I— - - " .3, 

425 



(2) INFORMATION FOR SEQ ID NO: 42: 

(i) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 428 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42: 

Met Lys Arg Asp Lys Val Gin Thr Leu His Gly Glu He His lie Pro 



10 



15 

Gly ASP Lys ser lie Ser His Arg Ser Val Met Fhe Gly Ala Leu Ala 
^° 25 30 

Ala Gly Thr Thr Thr Val Lys Asn Phe Leu Pro Gly Ala Asp Cys Leu 
^ 40 45 

Ser Thr lie Asp Cys Phe Arg Lys Met Gly Val His He Glu Gin Ser 



55 



60 



ser ser Asp Val Val lie His Gly Lys Gly He Asp Ala Leu Lys Glu 

° -5 90 

Pro Glu Ser Leu Leu Asp Val Gly Asn Ser Gly Thr Thr He Arg Leu 
^5 . 90 . 95 

Met Leu Gly He Leu Ala Gly Arg Pro Phe Tyr Ser Ala Val Ala Gly 
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ASP Glu Ser lie Ala Lys Arg Pro Met Lys Arg Val Thr Glu Pro Leu 

120 

Lys Lys Met Gly Ala Lys lie Asp Gly Arg Ala Gly Gly Glu Phe Thr 

140 



Pro Leu Ser Val Ser Gly Ala Ser Leu Lys Gly He Asp Tyr Val Ser 
" 155 

Pro val Ala Ser Ala Gin He Lys Ser Ala Val Leu Leu Ala Gly Leu 

170 175 

Gin Ala Glu Gly Thr Thr Thr Val Thr Glu Pro His Lys Ser Arg Asp 

185 

His Thr Glu Arg Met Leu Ser Ala Phe Gly Val Lys Leu Ser Glu Asp 

200 205 

Gin Thr ser Val Ser He Ala Gly Gly Gin Lys Leu Thr Ala Ala Asp 

215 220 

lie Phe val Pro Gly Asp He Ser Ser Ala Ala Phe Phe Leu Ala Ala 

230 235 



240 

Gly Ala Met Val Pro Asn Ser Arg He Val Leu Lys Asn Val Gly Leu 
245 250 255 

Asn Pro Thr Arg Thr Gly He He Asp Val Leu Gin Asn Met Gly Ala 

265 . 270 

Lys Leu Glu He Lys Pro Ser Ala Asp Ser Gly Ala Glu Pro Tyr Gly 

280 285 

ASP Leu He He Glu Thr Ser Ser Leu Lys Ala Val Glu He Gly Gly 
290 295 300 

ASP He He Pro Arg Leu He Asp Glu He Pro He He Ala Leu Leu 

^1° 315 320 

Ala Thr Gin Ala Glu Gly Thr Thr Val He Lys Asp Ala Ala Glu Leu 
225 330 

Lys Val Lys Glu Thr Asn Arg He Asp Thr Val Val Ser Glu Leu Arg 

345 350 
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Lys Leu Gly Ala Glu He Glu Pro Thr Ala Asp Gly Met Lys Val Tyr 
355 360 

Gly Lys Gin Thr Leu Lys Gly Gly ^Ala Ala Val Ser Ser His Gly Asp 

375 380 ^ 

His Arg He Gly Met Met Leu Gly He Ala Ser Cys He Thr Glu Glu 



395 



400 



Pro lie Glu lie Glu His Thr Asp Ala lie His Val Ser Tyr Pro Thr 

4j^5 

Phe Phe Glu His Leu Asn Lys Leu Ser Lys Lys Ser 
420 425 



(2) INFORMATION FOR SEQ ID NO: 43: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1293 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 1..1293 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43: 

ATG GTA AAT GAA CAA ATC ATT GAT ATT TCA GGT CCG TTA AAG GGC GAA 
Met val Asn Glu Gin He He Asp He Ser Glv Pro Leu Lvs Glv Glu 
^ 5 10 ^5 

ATA GAA GTG CCG GGC GAT AAG TCA ATG ACA CAC CGT GCA ATC ATG TTG 
He Glu Val Pro Gly Asp Lys Ser Met Thr His Arg Ala He Met Leu 
-0 :5 30 



GCG TCG CTA OCT GAA GGT GTA TCT ACT ATA TAT .^LAG CCA CTA C-""- GGC 
Ala Ser Leu Ala Glu Gly Val £er Thr lie Tvr lvs Pro Leu Leu G^y 
^5 40 45 



48 



96 



144 



-145 
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GAA GAT TGT CGT CGT ACG ATG GAC ATT TTr rr^ n.r^ 

Glu Asp Cys Arg Arg Thr Met Asn 111 III . '^'^ ^'^^ «AA 

50 „ Arg His Leu Gly Val Glu 

60 



^ 80 

z z ?s sii o^j^ z HI ?s ^ r - 

90 35 

ACA CGA TTA TTG GCA GOT TTP tt* ^ 
Thr Arg Leu Leu Ala Oil Zu IIJ ser G^; 

100 " f^^ Asn Glu Ser Val 

. 110 

TTG TCT GGC GAT GTT TCA ATT rrT a a* 

Leu Se. aiy Asp v.l sS^ ^ *™ '^'^ ™ 

115 ,Zi ^""^ Met Asp Arg Val Leu 

125 

- 5 ™ ^ z -I - tz ^li z z ^: 

■^-•^ 140 

TAT ACA CCA TTA ATT ATT AAG CCA TCT CTC at^ ... ^ 

Tyr Thr Pro Leu He He Lys Pro sTr vll r^t ^'"'^ 

145 ^ ^° Lys Gly He Asn Tyr 

160 

CAA ATG GAA GTT GCA Ar,'^ rr-a r-r. 

o.„ .... . - - - - ^ Jl! m - - - - 



175 



- ^ ^ =^ ?s 7. ?^ IT. z: z iT. 

185 

CGA AAT CAT ACT GAG ACG • ATG TTC AAA CAT TTT XAT ATT CCA ATT CAA 
Arg Asn Hxs Thr Glu Thr Met Phe Lys His Phe Asn Til p'o Til Tu 

205 

GCA GAA GGG TTA TCA ATT ^at >r-Tv ^ 

Ala Glu Gly .eu Ser ::I -i"" iff 

220 

CCT GCA GAT TTT C\T C--^ rr- . 
:.ys Pro Ala Aso Phe ^'^^ f ^ ^CA GCG TTC 

225 ' V:^ ^""^ -^^^ ^er Aia Ala Fhe 



192 



240 



288 



336 



384 



432 



480 



528 



576 



624 



672 



'20 



-30 

240 
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TTT ATT GTT GCA GCA CTT ATC Ara rrj. 

Phe lie Val Ala Ala lei lie Thr Pro cT f ^ 

ixe Thr Pro Gly ser Asp Val Thr He His 



250 255 



270 



280 285 



T^' aS III 7e IT, 7e ITn Jv" tT 

290 ^ lit ""^^ Met Leu Gin Pro He 

300 



325 

^•30 335 



768 



816 



AAA ATG GGC GGT AAT ATC CAA CTT T-rr ^^m ^, . 
..s OX, 0.. .s„ xae oTn IZ Z Z IT. =S 



912 



ACA ATC GAA GGA GAA TTA nTT nr-A a * 

Thr lie Glu Gly Gl^ leu vll Pro r7 ^""^ '^^^ 960 

305 tfn ^ -^^P I-s" Pro Val 

315 320 

11: s: |i ?s - en - ?s ?s - - -3. 



1056 



Ma 0=?^ ™ r r r 

^""^ Asp Thr Thr Ala 

345 

GAT ATG TTA AAC TTG TTA GGG TTT GAA TTA C.:^ CCA ACT AAT GAT CGA 
.sp Met Leu Asa Leu Leu Gly Phe Glu Leu Gin Pro ^Ir Sp g'S 

360 

TTG ATT ATT CAT CCG TCA GAA TTT AAA ACA AAT GCA ACA GAT ATT TTA 
Leu lie lie H.s Pro Ser Glu Phe Lys Thr Asn Ala Thr Asp lie lit 

ACT GAT CAT CGA ATA GGA ATG ATG CTT GCA GTT GCT TGT GTA CTT "^CA 
.hr Asp His. Arg He Gly Met Met Leu Ala Val Ala Cvs Val Leu Ser 

3?5 400 

AGC GAG CCT GTC .AAA ATC .^AA CAA TTT GAT GCT GTA AAT GTA TCA TTT ^248 
.er o.u Pro .al Lys lie Lys Gin Phe Asp Ala Val Asn Val Ser Phe 

410 415 

■:CA GGA TTT TTA CCA AAA CTA .AAG CTT TTA CAA AAT GAG GGA TAA .,93 
Pro Gly Phe ^eu Fro Lys Leu Lys Leu Leu Gin Asn- Glu Gly 
-20 425 430 



1104 



1152 



1200 
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(2) INFORMATION FOR SEQ ID NO: 44: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 430 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44; 

Met Val Asn Glu Gin He He A<!r. c 

^ -Lxe lie Asp He Ser Gly Pro Leu Lys Gly Glu 

10 15 

lie Glu Val Pro Gly Asp Lys Ser Met Thr His Arg Ala He Met Leu 

30 

Ala Ser Leu Ala Glu Gly Val Ser Thr He .yr Lys Pro Leu Leu Gly 



40 45 



Glu ASP Cys Arg Arg Thr Met Asp He Phe Arg His Leu Gly Val Glu 

55 60 

lie Lys Glu Asp Asp Glu Lys Leu Val Val Thr Ser Pro Gly Tyr Gin 

° 75 80 

val Asn Thr Pro His Gin Val Leu Tyr Thr Gly Asn Ser Gly Thr Thr 



85 90 



Thr Arg Leu Leu Ala Gly Leu Leu Ser Gly Leu Gly Asn Glu 



100 



Ser Val 



105 110 
Leu ser Gly Asp Val Ser He Gly Lys Arg Pro Met Asp Arg Val Leu 



115 120 



125 



Arg Pro Leu Lys Leu Met Asp Ala Asn He Glu Gly He Glu Asp Asn 

135 140 

Tyr Thr Pro Leu He He Lys Pro Ser Val He Lys Glv He Asn Tvr 

-50 - 

Hn Met Glu Val Ala Ser Ala Gin Val Lys Ser Ala He Leu P^e Ala 
-65 170 

Ser Leu Phe Ser Lys Glu Pro Thr He He Lys Glu. Leu Asp val Ser 



130 



135 190 
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AS„ „U Thr Olu T.. Me. PHe ... p,, „^ 

zuo 205 
AU Glu Gly Leu s« He As„ Thr Thr P„ Olu Ala He A„ xle 

'^■'•^ 220 
LV3 Pro A.P Phe Hi. val P„ 01. He ser Ser Ala Ala Phe 

240 

^he lie V.1 AU Ala Leu He Thr P„ oly Se. Asp Val Thr He His 

A.n val Gl. He Ae„ Gl„ Thr Ar, Ser Gly He He Asp He Val Glu 

265 270 

Lys Met Gly Gly Asn He Gin Leu Phe Asn Gin Thr Thr Gly Ala Glu 

280 285 

Pro Thr Ala Ser He Arg lie Gin Tyr Thr Pro Met Leu Gin Pro II 



295 300 



Thr lie Glu Gly Glu Leu Val Pro Lys Ala He Asp Glu Leu Pro Val 



315 



320 



lie Ala Leu Leu Cys Thr Gin Ala Val Gly Thr Ser Thr He Lys Asp 
325 330 335 

Ala Glu Glu Leu Lys Val Lys Glu Thr Asn Arg He Asp Thr Thr ^la 

345 

Asp Met Leu Asn Leu Leu Gly Phe Glu Leu Gin Pro Thr Asn Asp Gly 
^55 360 365 

Leu He He His Pro Ser Glu Phe Lys Thr Asn Ala Thr Asp He L, 



370 



75 



eu 



;80 



Thr ASP His Arg He Gly Met Met Leu Ala Val Ala Cvs Val Leu ^er 

3?5 ' 400 



Ser Glu Pro Val Lys II 



e Lys Gin Phe Asp Ala Val Asn Val £er Phe 



■^^5 42.5 

Pro Gly Phe Leu Pro Lys Leu Lys Leu Leu Gin Asn Glu Glv 

425 430 
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(2) INFORMATION FOR SEQ ID NO:45: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other nucleic acid 

(A) DESCRIPTION: Synthetic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 45: 
GGAACATATG AAACGAGATA AGGTGCAG 

28 

(2) INFORMATION FOR SEQ ID NO: 46: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other nucleic acid 

(A) DESCRIPTION: Synthetic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 46: 
GGAATTCAAA CTTCAGGATC TTGAGATAGA AAATG 3 
(2) INFORMATION FOR SEQ ID NO: 47: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: :5 base oairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other nucleic acid 

(A) DESCRIPTION:' Synthetic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:47: 
GGGGCCATGG TAAATGAACA .^TCATTG 

28 



-150- 
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(2) INFORMATION FOR SEQ ID NO: 48: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pair 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Other nucleic acid 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 48: 
GGGGGAGCTC ATTATCCCTC ATTTTGTAAA AGC 



(2) INFORMATION FOR SEQ ID NO: 49: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 480 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 49: 



(A) DESCRIPTION: Synthetic DNA 



33 



:.eu Thr Asp Glu Thr Leu Val 



Tyr Pro ?he Lys Asp He Pro Ala Asp 

15 



Gin Gin Lys Val Val He Pro 
20 



Pro Gly Ser Lys Ser He Ser Asn Arg 
25 30 



Ala Leu He Leu Ala Ala Leu 

35 



Gly Glu 
40 



Gly Gin Cys Lys He Lys Asn 
45 



Leu Leu His Ser Asp Asd Thr 
50 ' 



Lys His Met Leu Thr Ala Val His Glu 

50 



Leu Lys Giy Ala Thr He Ser 
65 -0 



Trp Glu Asp Asn Gly Glu Thr Val Val 



Val Glu Gly His Gly Giy Ser Thr L 
= 5 * 



eu Ser Ala Cys Ala Asp Pro Leu 



.1 CI 
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Tyr Leu Gly Asn Ala Gly Thr Ala » 

100 ''^^ Thr Ser Leu Ala 

110 

Ala Leu Val Asn Ser Thr Ser S*»r ri„ r a^. ' , 

115 f^^ ""^^ Val Leu Thr Gly 

125 

Asn 

■^^^ 140 
AT, As„ at. Th. ne Olu ,Vr Leu A=n cxu OX. Ser Leu 

■■•^^ 160 
Pro lie Lvs Val Tyr Thr Ser v,l Phe C.s CI. ox. 

170 3^75 

Leu AX. .X, T.r VaX Ser Ser 0X„ ^r VeX Ser Ser ZXe Leu „e. Cs 

■^3= 190 
AX, Pro T.r AXa OXu OXu Pro V,X Thr Leu AX, Leu V,X 01. oX. L.3 

200 205 

Pro lie ser Lys Leu ^r Val Asp Met Thr He Lys Met Met Glu Lys 

^-^^ 220 

Phe Gly lie Asn Val Glu Thr Ser Thr Thr Glu Pro Tyr Thr Tyr Tyr 



240 



He Pro Lys Gly His Tyr lie Asn Pro Ser Glu Tyr Val He Glu Ser 
^"^^ -50 255 

ASP Ala ser Ser Ala Thr Tyr Pro Leu Ala Phe Ala Ala Met Thr Gly 
260 265 270 

Thr Thr Val Thr Val Pro Asn lie Gly Phe Glu Ser Leu Gin Gly Asp 
-^5 280 285 



Ala Arg Phe Ala Arg Asp Val Leu Lys Pro Met Gly Cys Lys He Thr 
-'0 295 300 



Gin Thr Ala Thr Ser Thr Thr Val Ser Gly Pro Pro Val Gly Thr Leu 

320 



305 310 



Lys Fro Leu Lys His Val Asp Met Glu =ro Met Thr Asp Ala Phe Leu 
■25 3 30 335 

Thr Ala Cys Val Val Ala Ala He Ser His Asp Ser Asp Pro Asn Ser 

345 350 
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Ala Asn Thr Thr Thr lie Glu Gly He Ala Acn ri . 

355 Arg Val Lys Glu 

365 

Cys Asn Arg He Leu Ala Met Ala Thr- ri„ r 

370 ^^"^ '-s" Ala Lys Phe Gly Val 

380 

^.s r.r Th. 01. .eu P„ ..p ol. XU 01„ v.l His 01. .eu .sn Ser 

400 

lie .ys ASP Leu Val Pre s.r Asp s« Ser oly Pro Val 01. val 
cys ™r avr ASP Asp His Ar, Val Ala Mat Sar Phe Ser Leu La„ Ala 

Gly Met val Asn Ser Gin Asn Glu Arg Asp Glu Val Ala Asn Pro Val 
Arg lie Leu Glu Arg His Cys Thr Gly Lys Thr Trp Pro Gly Trp Trp 



455 460 



ASP val Leu His Ser Glu Leu Gly Ala Lys Leu Asp Gly Ala Glu Pro 

4/u 



480 



INFORMATION FOR SEQ ID NO: 50: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 460 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

!ii) MOLECULE TY?E: protein 



(xi) SEQUENCE DESCRIPTION: iEQ ID NO: 50: 

Leu Ala ?ro Ser :ie Glu Val His Pro Gly Val Ala His Ser Ser Asn 

10 15 

Val He Cys Ala Pro Fro Gly Ser Lys Ser lie Ser Asn Ara Ala Leu 
20 25 30' 

Val Leu Ala Ala leu Gly Ser Gly Thr Cys Arg He Lys Asn Leu Leu 
■■- -iO 45 




•153- 



38-21(10660)A 



His Ser ASP ASP Thr slu Val Met Leu As„ M. Leu OZu Ar, L.u Oly 

= 60 ^ 

Al. Al, Thr .he ser Trp Olu =lu «lu .1, 

80 

=ly ^.s Cl, 01, 3„ ^^^^ 

Asn Ala 01, ,1, ser Ar, P.e Leu Thr T,. val AU Thr l AU 

110 

Asn ser Ser Thr v.l Asp Ser Ser Val Leu TKr 01, A.„ as„ Ar, Me. 

-^^^ 125 

L.S 01„ Ar, Pre He 01, Asp Leu Val Asp Ala Leu Thr Ala Ash Val 

140 

Leu Pro Leu Asn Thr Ser L,s 01, Ar, Al. Ser Leu Pro Leu L,s He 

160 

Ala Ala Ser Oly Gly Phe Ala Gly Gly Asn He Asn Leu Ala Ala Lys 



val Ser Ser Gin Tyr Val Ser Ser Leu Leu Met Cys Ala 



180 



185 



Lys Glu Pro Val Thr Leu Arg Leu Val Gly Gly Lys Pro 



195 



200 



175 



Pro Tyr Ala 
190 



He Ser Gin 



205 



Pro Tyr He Asp Met Thr Thr Ala Met Met Arg Ser Phe Glv He Asp 

215 220 



Val Gin Lys Ser Thr Thr Glu Glu His Thr Tyr His II 



225 



230 



235 



e Pro Gin Gly 
240 



Arg Tyr Val Asn Fro Ala Glu Tyr Val i: 



'*e Glu Ser Asp Ala Ser Cys 
-45 250 



255 



Ala Thr Tyr Pro Leu Ala Val Ale 



La Ala Val Thr Gly Thr Thr Cys Thr 
•^0 265 



270 



Val Pro Asn 11 



e Gly Ser Ala Ser Leu Gin Gly Aso Ala Arc Phe Ma 
-••5 230 



235 



Val Glu Val Leu Arg Pro Met Gly Cys Thr Vai Glu Gin Thr Glu 



:90 



195 



Thr 



300 



-154- 



38-21(10660)A 



Ser Thr Thr Val Thr Cl. p.o Ser .sp Oly lie Leu A., .xa Thr Ser 

320 

A., Oly xy. CI. ™. .s„ Asp A., Cvs V.l ... cy, PH. 

Thr Olv se. His „e. clu Ser cl„ T.. ... p„ Z v,l 

ser S,r CI. Ue Ma Asn CI„ Ar, Cys clu Cys A3„ Ar. xle L.. 

365 

AU Met L.. ASP Olu .eu Al, 01. v.l He eye Ar, Clu His 

-'^ 380 
Asp Asp Gly Leu Glu He Asn riw ;^ 

385 ^ Arg Ser Asn Leu Arg Gin 

400 

Pro val Gly Gly Val Phe Cys Tyr Asp Asp His Arg Val Ala Phe Ser 

410 

Phe ser Val Leu Ser Leu Val Thr Pro Gin Pro Thr Leu He Leu Glu 

425 

Lys Glu cys val Gly Lys Thr Trp Pro Gly Trp Trp Asp Thr Leu Arg 



440 

Gin Leu Phe Lys Val Lys Leu Glu Gly Lvs Glu Leu 

-155 

INFORMATION FOR SEQ ID NO: 51: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 444 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

ii) MOLECULE TYPE: protein 



ixi) SEQUENCE DESCRIPTION: SEQ ID N0:=1: 

lys Ala Ser Glu He Val Leu Gin Pro He Arg Glu He Ser Gly Leu 

. --3 ' 15 




38.21(10660)A 



lie Lys Leu Pro Gly Ser Lys Ser Leu Ser Asn Arg He Leu Leu Leu 

25 30 

Ala Ala Leu Ser Glu Gly Thr Thr Val Val Asp Asn Leu Leu Asn Ser 

40 45 

ASP ASP lie Asn Tyr Met Leu Asp Ala Leu Lys Lys Leu Gly Leu Asn 
i =5 60 

val Glu Arg Asp Ser Val Asn Asn Arg Ala Val Val Glu Gly Cys Gly 

° 75 go 

Gly lie Phe Pro Ala Ser Leu Asp Ser Lys Ser Asp He Glu Leu TVr 
35 =0 95 

Leu Gly Asn Ala Gly Thr Ala Met Arg Pro Leu Thr Ala Ala Val Thr 



105 no 

Ala Ala Gly Gly Asn Ala Ser Tyr Val Leu Asp Gly Val Pro Arg Met 

^25 



Arg Glu Arg Pro He Gly Asp Leu Val Val Gly Leu Lys Gin Leu Gly 

135 

Ala ASP val Glu Cys Thr Leu Gly Thr Asn Cys Pro Pro Val Arg Val 

150 155 

Asn Ala Asn Gly Gly Leu Pro Gly Gly Lys Val Lvs Leu Ser Gly Ser 
-55 175 

He ser Ser Gin Tyr Leu Thr Ala Leu Leu Met Ala Ala Pro Leu Ala 

190 

Leu Gly Asp Val Glu He Glu He He Asp Lys Leu He Ser Val Pro 

205 



180 185 



195 200 



- . . - Met Glu Arg Phe Gly Val Ser Ala 

^'^ 215 220 



Tyr Val Glu Met Thr Leu Lys Leu 

215 

Glu H.s Ser Asp Ser Trp Asp Arg Phe ?he Val Lys Gly Gly Gin Lys 



-30 235 



>40 



Tyr Lys Ser Pro Gly Asn Ala Tyr Val Glu Gly Aso Ala Ser Ser Ala 

-•15 250 255 

Ser T-zr Phe Leu Ala Gly Ala Ala He Thr Gly Glu Thr Val Th- •'al 

265 270 



38.21(10660)A 



o:u Cl, c.. ox. ™, ^e. .eu 0I„ ox. ..p vaZ .ys PK, ca„ 
val Olu „e. ox, cys s„ Trp TJ. o! .3„ Se. v.X 

Thr V,l Thr 01, P„ Sej .3p AU Phe Oly Met „u ^, 

Ala val .sp val „e. .sn Met Pro Asp Val „a Met Thr Z 

AU val val Ma .eu PHe Ala Asp 01. P.o Th. xle A., ^sp val 

350 

Ala Ser Trp Arg Val Lys Glu Thr Glu Arg Met He Ala He Cys 

360 

Olu .eu Ar, .y. .eu 01. Ala Thr Val olu OXu 01. Ser Asp ^r Cs 

val II, Thr Pro Pro Ala L.s Val ..s Pro Ala OXu IXe Asp Thr T.r 

400 

ASP ASP His Ar, Met AXa Met Ala Phe Ser Leu Ala Ala Cs Ala Asp 

410 

val Pro val Thr XXe L.s Asp Pro 01. Cs Thr Ar, L.s Thr Phe Pro 



Thr 



425 



430 



Asp Tyr Phe Gin Val' Leu Glu Ser He Thr Lvs "^'s 

(2) INFORMATION FOR SEQ ID NO: 52: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 444 amino acids 

(B) TYPE: amino acid 
■D) TOPOLOGY: linear 

'ii) MOLECULE TYPE: protein 



38-21(10660)A 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 52: 

Lys Ala ser Glu lie Val Leu Cln Pro He Arg Clu He Ser Oly Leu 

10 

a. Lys Leu oly Ser Lys Ser Leu Ser As„ Ar, He Leu Leu Leu 
i 25 30 

Ala Ala Leu Ser Glu Gly Thr Thr Val Val Asp Asn Leu Leu Asn Ser 

40 45 

ASP ASP He Asn Tyr Met Leu Asp Ale Leu Lys Arg Leu oly Leu As„ 

55 60 

val Glu Thr Asp Ser Glu Asn Asn Arg Ala Val Val Glu Gly Cys Gly 
Gly He Phe Pro Ala Ser He Asp Ser Lys Ser Asp He Glu Leu Tyr 



90 95 



80 

a Ser He Asp Ser Lys Ser Asn n « m„ r 

85 

Leu Gly Asn Ala Gly Thr Ala Met Arg Pro Leu Thr Ala Ala Val Thr 
100 105 

Ala Ala Gly Gly Asn Ala Ser Tyr Val Leu Asp Gly Val Pro Arg Met 

120 125 

Arg Glu Arg Pro He Gly Asp Leu Val Val Gly Leu Lys Gin Leu Gly 
"° 135 140 ^ 

Ala Asp Val Glu Cys Thr Leu Gly Thr Asn Cys Pro ?ro Val Arg Val 

-50 155 160 

Asn Ala Asn Gly Gly Leu Pro Gly Gly Lys Val Lys Leu Ser Gly Ser 
-65 170 1^75 • 

He Ser £er Gin Tyr Leu Thr Ala Leu Leu Met Ser Ala Pro Leu Ala 

185 190 

Leu Gly Asp Val Glu He Glu He Val Asp Lys Leu He Ser Val Pro 
"5 200 . 205 

Tyr Val Glu Met Thr Leu Lys Leu Met Glu Ara Phe Glv Val Ser Val 
-10 :i5 .20 

Glu His Ser Asp ^'er Trp Asp Arg Phe Phe Val Lys Glv Glv Gin Lys 

^35 240 



38-21(10660)A 



..3 s„ P„ 01, Ma ryr v.l cxu oi, .^p Ma Ser Ser Ma 

Cys ry. PH. .eu Ma 01. Ma Ma n, TKr OX, clu T.. .ax vaX 

270 

Clu Olv C.S ex. ™r Thr s„ .eu OXn =x. Asp VaX .ys PHe Ma OXu 

^ 285 
VaX Leu oxu lys ex. Cys Lys VaX Sar Trp Thr OXu Asn Ser VaX 

TKr VaX Thr OXy Pro P„ A., Asp AXa Phe CXy Met A„ His Leu A„ 

320 

Al» ne ASP VaX As„ „ec As„ Lys Mec Pro Asp VaX AXa Met Thr Leu 

335 

Ala VaX VaX Ma Leu Phe AXa Asp oly Pro Thr Thr IXe Ar, Asp VaX 

■^^5 350 
AXa ser Trp Ar, VaX Lys OXu Thr =Xu Ar, Met rle AXa IXe Cys Thr 

365 

OXu Leu Ar, Lys Leu OXy Ala Thr VaX OXu OXu oly Ser Asp Tyr Cys 

V.l IXe Thr Pro Pro Lys Lys Val Lys Thr AXa OXu IXe Asp Thr Tyr 

400 

ASP ASP K.s Arg Met Ala Met Ala Phe Ser Leu Ala Ala Cys Ala Asp 

410 

Val Pro lie Thr He Asn Asp Ser Gly Cys Thr Arg Lys Thr Phe Pro' 

425 430 

Asp Tyr Phe Gin Val Leu Glu Ara He Thr Lys H^s 
-^35 440 

{2) INFORMATION FOR SEQ ID NO: 53: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 44 amino acids 
iB) TYPE: ammo acid 
O) TOPOLOGY: linear 

fii) MOLECULE TYPE: protein 



• # 



38-21(10660)A 



(xil SEOUENCE DESCRIPTION; SEQ ID NOt53: 

Pro As„ Olu lie Val Leu cl„ P„ iie L.s Asp iXe Ser cly Thr 

val .eu Pro 01. ser ..s Ser .eu Ser Asn Ar, He .eu .eu .eu 
i 30 

Ala Ala Leu Ser Lys Gly Arg Thr Val Val Asp Asn Leu Leu Ser Ser 

40 45 

ASP ASP lie Hi. Tyr „ec Leu Gly Ale Leu Lye Thr Leu cly Leu His 

= = 50 

val Clu ASP ASP As„ Clu Asn Gl„ Ar, Ala lie Val Clu cly Cys Cly 

"^^ 80 
Gly Gin Phe Pro Val Gly Lys Lys Ser Glu Glu Glu He Gin Leu Phe 



90 



95 



Leu Gly Asn Ala Gly Thr Ala Met Arg Pro Leu Thr Ala Ala Val Thr 



105 



110 



val Ala Gly Gly His Ser Arg Tyr Val Leu Asp Gly Val Pro Arg Met 

X 2 5 

Arg Glu Arg Pro He Gly Asp Leu Val Asp Gly Leu Lys Gin Leu Gly 



35 140 
— I'^Q '^^^ -^'"^ Pro Pro Val Arg He 



Ala Glu Val Asp Cys Phe Leu Gly Thr Asn Cvs Pro Pro Val 

155 160 



val Ser Lys Gly Gly Leu Pro Gly Gly Lys Val Lys Leu Ser Gly Ser 
^"5 :7o j,75 

lie Ser £er Gin Tyr Leu Thr Ala Leu Leu Met Ala Ala Pro Leu Ala 

135 190 



180 



Leu Gly Asp Val Glu lie Glu He He Asp Lys Leu He Ser Val Pro 

200 205 

Tyr Vai Glu Met Thr Leu Lys Leu Met Hu Arg Phe Gly Val Ser Val 

"5 220 

Glu His Thr Ser Ser Trp Asp Lys Phe leu Val Arg Gly Gly Gin Lys 

"0 22S 240 



-160- 



38-21(10660)A 



ryr ..3 se. P„ ^^^^^^ 

255 

- ... ^^^^ 

OIU 01. 01, s„ s« 0X„ 01. .sp V.1 ... Z 

val .eu OXuVs Met 01, .1 OX. v.l .H. .H. 

"^^^ 300 
Thr val Lys 01. p„ p.. ,3. ser Se. oly 

Al. val .sp val Asn Met As„ Met Pro .sp Val Ala „.t Thr 

335 



Ala Val Val Ala Leu Phe Ala Asp Gly Pro Thr Ala . . 

340 '^^^ Asp Val 

■^^5 350 

Ala ser Trp Ar, Val ... olu Thr Olu Ar, Met lie Ala He Cs Thr 

365 

Olu Leu Ar, L.S Leu 01. Ala Thr Val Val olu 01. Ser Asp T.r Cs 

lie lie Thr Pro Pro Olu L.s Leu As„ Val Thr olu lie Asp Thr T.r 

--^ 400 
ASP ASP H.s Arg Met Ala Met Ala Phe .er Leu Ala Ala Cys Ala Asp 



410 



415 



'al Pro val Thr He Lys Asp Pro Gly Cys Thr Arg Lys Thr Phe Pro 



425 



430 



Asn T,-r Phe Asp Val Leu Gin Gin Tyr Ser Vvs Hi^ 
^-^^ 440 

MFORMATION FOR SEQ ID NO: 54: 

-i) SEQUENCE CHARACTERISTICS: 

•A) LENGTH: 444 ammo acids 
'3) TYPE: amino acid 
O) TOPOLOGY: linear 

li) MOLECULE TYPE: protein 



• # 



38-21(10660)A 



(xil SEQUENCE DESCRIPTION: SEQ ID NO:54, 

I.ys ..o His Clu lU val .eu Xa, P„ Ue .ys Asp Ue Ser 01, Thr 

val ..s .eu ,.0 01, se. .,s Sa. .eu Se. As„ iie 1 

'^^ 30 

Al. Ala .eu Se. Olu ol, 

45 

Asp Asp He His Tvr Met r ah r^.r m r 

Glo ASP ASP Asn Clu Asn Gin a., a1, lie Val Olu Oly Cys Oly 

80 

=ly 01„ Phe P„ val aiy Lys .ys Ser Olu olu Olu lU Gin Leu Ph. 

90 95 

Leu Oly Asn Ala oly Thr Ala „e. Ar, P„ .eu Thr Ala Ala Val Thr 
val Ala Oly oly His Ser A„ Tyr Val Leu Asp oly Val Pro Ar, Met 

Arg Glu Arg Pro He Gly Asp Leu Val Asp Gly Leu Lys Gin Leu Gly 

Ala Glu val ASP cys Ser Leu Gly Thr Asn Cys Fro Fro Val Arg He 

155 160 

val ser Lys Gly Gly Leu Fro Gly Gly Lys Val Lys Leu Ser Gly Ser 

170 

lie Ser Ser Gin Tyr Leu Thr Ala Leu Leu Met Ala Ala Pro Leu Ala 



His 



.85 



190 



:eu Gly ASP Val Glu lie Glu lie He Asp Lys Leu He Ser Val Pro 

:00 205 

ryr val Glu Met Thr Leu Lvs Leu Met Glu Arg Phe Gly Val Phe Val 

- 1 5 



38.21(10660)A 



OXo s„ se. c.. 

240 

..3 s„ 0,, oxu o.. 

- PHe 01. 

'^''^ 270 
0.. c.. ... ^^^^^ 

v.l Leu Clu .,3 Me. oz. .1, val Th. r.r OXu .s„ Se. v,l 

300 

- v,x .,3 CI. P.O P„ .s„ ser se. OX. Met ..s „u .eu 

320 



lie ASP val .s„ Me. .s„ ..s Met P„ .3p v,l Met THr ... 

Ala V,l .u neu PHe Asp 01. P„ ™, ^ 

■^^^ 350 

AU .er v,l ..3 Olu T.. olu .r. Met lie AX, lie 0.3 Thr 

350 

Olu Leu A., ..3 .eu 01. .1. Thr v,l Val olu 01. Se. A3p Tvr c.s 



He lie Thr Pr 
385 



o Pro Glu Lys Leu Asn Val Thr Glu He Asp Thr Tyr 



395 



400 

ASP ASP „.s A„ „et Ala Met Al. PHe se. Leu Ala Ala C.3 Ala Asp 

415 

val P„ val Thr lie ..3 Asn Pro 01. c.3 Thr Ar, L.s Thr Phe Pro 

ASP Tyr Phe Glu Val Leu Gin Lys Tyr Ser Lys His 



38-21(10660)A 



(2) INFORMATION FOR SEQ ID NO: 55: 

(i) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 444 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:55: 

Lys Pro Ser CIu iXe Val Leu Cln Pro lie L.s C.u He Ser 01. Thr 

Val L.s Leu Pro Gl. Ser L.s Ser Leu Ser Asn Ar. He Leu Leu Leu 

2^ 30 

Ala Ala Leu Ser Glu Gly Thr Thr v^i u^i a , 

3 5 y inr Thr Val Val Asp Asn Leu Leu Ser Ser 

40 45 

A.P A.P ne „U „e. oly ... 

^5 60 
V.1 Glu Olu ASP ser Ala Asn Gl„ Ar, Al. v,l v,l Glu Gly Cys OXy 

'^^ 80 
Gly Leu Phe P.O V.l Oly Lys Glu Ser Lys Glu Glu He Gl„ Leu Phe 

' J 95 
Leu Gly Asn Ala Gly Thr Ala Met Arg Pro Leu Thr Ala Ala Val Thr 

val Ala Gly Gly Asn Ser Arg Tyr Val Leu Asp Gly Val Pro Arg Met 

Asp Leu Val Asp Gly Leu Lys Gin Leu Gly 

Ala Glu val ASP Cys Phe Leu Gly Thr Lys Cys Pro Pro Val Arg He 

--° -,60 

val £er Lys Gly Gly Leu Fro Gly Gly Lys Val Lys Leu Ser Gly ^er 

170 



-20 125 

Arg Glu Arg Pro lie Ser 




38-21(10660)A 



He Ser Ser Gin Tyr Leu Thr 



180 ""'^ "-^^ ""^^ t.eu Ala 

190 



Leu Gly Asp Val Glu He Glu lie He Asp Lys Leu II 



200 ^ ^^'^ Pro 

^00 205 



TV. V,l aiu M.. T.. 



220 

Glu His se. ser Ser Trp Asp Arg Phe Phe Val Arg Oly oly Gl„ 

^■'^ 240 
Tyr Lys ser Pro Oly Lys Ala Phe Val Glu Gly Asp Ala Ser Ser Ala 

^oO 255 
ser Tyr Phe Le. Ala Oly Ala Ala Val T.r Gly Gly Thr He Thr Val 

270 

Glu Gly Cys Gly Thr Asn Ser Leu Gin Gly Asp Val Lys Phe Ala Glu 

280 285 

val Leu Glu Lys Met Gly Ala Glu Val Thr Trp Thr Glu Asn Ser Val 

Thr val Lys Gly Pro Pro Arg Ser Ser Ser Gly Arg Lys His Leu Arg 

315 



Ala He Asp Val Asn Met Asn Lys Met ?ro Asp Val \1 

3 2 5 



320 

a Met Thr Leu 
335 



Ala Val Val Ala Leu Tyr Ala Asp Gly Pro Thr Al 

345 



340 



a rie Arg Asp Val 



Ala Ser Trp Arg Val Lys Glu Thr Glu Arg Met He Ala He Cys Thr 
^55 365 

Glu Leu Arg Lys Leu Gly Ala Thr Val Glu Glu Glv Pro Asp Tvr Cvs 
-'3 375 380 " 

He He Thr Pro Pro Glu Lys Leu Asn Val Thr Asp He Asp Thr Tyr 

390 395 4^0 

Asp ASP H.s Arg Met Ala Met Ala Phe £er Leu Ala Ala Cvs Ala Asp 
-05 



Val Fro Val Th 



r --e Asn Asp Fro Gly Cys Thr Arg Lys Thr Phe Pro 
•»20 425 430 



38.21(10660)A 



Asn Tyr Phe Asp Val Leu Gin Gin Tyr Ser Ly 



440 

(2) INFORMATION FOR SEQ ID NO: 56: 

(i) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 444 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



s His 



(ii) MOLECULE TYPE: prot 



em 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 56: 

Ala Gly Ala Glu Glu He Val Leu Gin Pro He Lys Glu He Ser Gly 

10 

Thr val Lys Leu Pro Gly Ser Lys Ser Leu Ser Asn Arg He Leu Leu 

25 30 
Leu Ala Ala Leu Ser Glu Gly Thr Thr Val Val Asp Asn Leu Leu Asn 



40 



45 



Ser Glu Asp Val His Tyr Met Leu Gly Ala Leu Arg Thr Leu Gly Leu 
Ser Val Glu Ala Asp Lys Ala Ala Lys Arg Ala Val Val Val Gly Cys 
Gly Gly Lys Phe Pro Val Glu Asp. Ala Lys Glu Glu Val Gin 



35 90 



Leu Phe 
95 



Leu Gly Asn Ala Gly Thr Ala Met Arg Pro Leu Thr Ala Ala Val Thr 

105 no 

Ala Ala Gly Gly Asn Ala Thr Tyr Val Leu Asp Gly Val Pro Arg Met 
--5 120 125 

Arg Glu Arg Pro He Gly Asp Leu Val Vad Gly Leu Lys Gin Leu Gly 

135 140 

Ala Asp Vai Asp Cys Phe Leu Gly Thr Asp Cys Pro Pro Val Arg Val 
''^ -50 155 160 



38.21(10660)A 



AS„ al. U. 01. Gly ..u Pro GXy Gly ... val Ly. ser Cly Se. 

^"^^ 175 
He ser s„ cln Tyr Leu Ser Ala Leu Leu „et Ala Ala Pro Leu Pro 

Leu Gly Asp Val Glu He Glu lie He Asp Lys Leu He Ser He Pro 

^00 205 

Tyr val Glu Met Thr Leu Ar, Leu Met Glu Ar, Phe Gly Val Lys Ala 

220 

Glu His ser Asp Ser Trp Asp Arg Phe Tyr He Lys Gly Gly Gin Lys 

235 240 
Tyr Lys Ser Pro Lys Asn Ala Tyr Val Glu Gly Asp Ala Ser Ser Ala 

250 255 

Ser Tyr Phe Leu Ala Gly Ala Ala He Thr Gly Gly Thr Val Thr Val 

265 270 

Glu Gly Cys Gly Thr Thr Ser Leu Gin Gly Asp Val Lys Phe Ala Glu 

280 



285 



val Leu Glu Met Met Gly Ala Lys Val Thr Trp Thr Glu Thr Ser Val 

295 300 

Thr val Thr Gly Pro Pro Arg Glu Pro Phe Gly Arg Lys His Leu Lys 

'-^ —5 320 

Ala He Asp val Asn Met Asn Lys Met Pro Asp Val Ala Met Thr Leu 
325 330 

Ala Val val Ala Leu Phe Ala Asp Gly Pro Thr Ala He Arg Asp Val 
340 

Ala Ser Trp Arg Val Lys Glu Thr Glu Arg Met Val Ala He Arg Thr 

360 355 

Glu Leu Thr Lys Leu Gly Ala Ser Val C-lu Glu Gly Pro Asp Tyr Cys 
" 375 330 

335 'll'^ ^"^^ '•■^1 Thr Ala He Asp Thr Tyr 

•400 



:?o ^95 



ASP Asp H.s Arc Met Ala Met Ala Phe £er Leu Ala Ala Cvs Ala Glu 
^05 -10 



-167- 



38-21(10660)A 



Val Pro val Thr He Arg Asp Pro Gly Cys Thr Arg Lys Thr Phe Pro 



425 

Asp Tyr Phe Asp Val Leu Ser Thr Phe Val Lys Asn 
435 

2) INFORMATION FOR SEQ ID NO: 57: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 427 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



430 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 57: 



Met Glu Ser Leu Thr Leu Gin Pro lie Ala Arg Val Asp Gly Ala He 
^ 10 15 

Asn Leu Pro Gly Ser Lys Ser Val Ser Asn Arg Ala Leu Leu Leu Ala 



20 25 



30 



Ala Leu Ala Cys Gly Lys Thr Val Leu Thr Asn Leu Leu Asp Ser Asp 

40 45 



ASP Val Arg His Met Leu Asn Ala Leu Ser Ala Leu Gly He Asn Tyr 

^- 60 



Thr Leu Ser Ala Asp Arg Thr Arg Cys Asp He Thr Gly Asn Gly Gly 

Pro Leu Arg Ala Pro Gly Ala Leu Glu Leu Phe Leu Gly Asn Ala Gly 
35 90 95 

Thr Ala Met Arg Fro Leu Ala Ala Ala Leu Cys Leu Gly Gin Asn Glu 
-00 105 no 

:ie Val Leu Thr Gly Glu Pro Arg Met Lys Glu Arc Pro lie Gly His 

--0 125 

Leu Val Asp £er Leu Arg Gin Gly Gly Ala Asn He Asp Tvr Leu Glu 

-35 140 



38.21(10660)A 



Gin Glu As„ Tyr Pro Pro Leu Ar, Leu Ar, Gly Oly Phe u, oiy 

160 

ASP lie Glu val Asp Gly Ser Val Ser Ser Gin Phe Leu Thr Ala Leu 

iOb -inn 

Leu Met Thr Al. Pro Leu Ala Pro Lys Asp Thr He He Arg v.l Lys 

185 

Gly Glu Leu V.1 ser Lys Pro Tyr He Asp He Thr Leu Asn Leu Met 

200 205 

Lys Thr Phe Gly Val Glu He Ala Asn His His Tyr Gin Gin Phe Val 

220 

val Lys Gly Gly Gin Gin Tyr His Ser Pro Gly Arg TVr Leu Val Glu 

235 240 
Gly Asp Ala Ser Ser Ala Ser Tyr Phe Leu Ala Ala Gly Ala He Lys 
245 250 255 

Gly Gly Thr Val Lys Val Thr Gly He Gly Arg Lys Ser Met Gin Gly 
260 265 



270 



ASP He Arg Phe Ala Asp Val Leu Glu Lys Met Gly Ala Thr He Thr 
275 280 285 

Trp Gly Asp Asp Phe He Ala Cys Thr Arg Gly Glu Leu His Ala He 

-95 2 00 

Asp Met: Asp Met Asn His He Pro Asp Ala Ala Met Thr He Ala Thr 

310 315 

Thr Ala Leu Phe Ala Lys Gly Thr Thr Thr Leu Arg Asn He Tyr Asn 

325 330 

Trp Arg Val Lys Glu Thr Asp Arg Leu ?he Ala Met Ala Thr Glu Leu 
^''O 345 550 

Arg Lys Val Gly Ala Glu Val Glu Glu Gly His Aso Tvr He Arg He 
-•=5 3 50 365 

Thr ?ro Fro Ala Lys Leu Gin His Ala Asp He Gly Thr Tvr Asn Asp 

'■^ ^^5 330 



His Arg Met Ala Met Cvs Phe £ 

^30 ~ 395 



" ''^^ "^'^ '^^ -^^^ ^'^^ '^sp Thr Fro 



• # 



38-21(10660)A 



(i) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 427 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



415 



Val Thr He Leu Asp Pro Lys Cys Thr Ala Lys Thr Phe Pro . 

405 Asp Tyr 

Phe Glu Gin Leu Ala Arg Met Ser Thr Pro Ala 

425 

(2) INFORMATION FOR SEQ ID NO: 58: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 58: 

Met Glu ser Leu Thr Leu Gin Pro He Ala Arg Val Asp Gly Ala He 
5 10 15 

Asn Leu Pro Gly Ser Lys Ser Val Ser Asn Arg Ala Leu Leu Leu Ala 



25 30 



Ala Leu Ala Cys Gly Lys Thr Val Leu Thr Asn Leu Leu Asp Ser Asp 

40 -45 

Asp val Arg His Met Leu Asn Ala Leu Ser Ala leu Gly He Asn Tyr 

-5 50 

Thr Leu ser Ala Asp Arg Thr Arg Cys Asp lie Thr Gly Asn Gly Gly 

Pro Leu Arg Ala Ser Gly Thr Leu Glu Leu Phe Leu Gly Asn Ala Gly 

?0 95 

Thr Ala Met Arg Pro Leu Ala Ala Ala Leu Cys Leu Gly Gin Asn Glu 

1.05 nn 



He Val Leu Thr Gly Glu Pro Arg Met Lys Glu Ara Pro He Gly His 
--5 120 125 

• '^'^^ '^-y bly Axa Asn He Asp Tyr Leu Glu 

i35 



.170 

38-21(10660)A 



Oln Glu As„ ryr Pro P„ 

150 

*sp Glu va: ..p ox. ser se. .er Oln Phe L.u Th. Ala Leu 

170 

Leu .H. Ma Ma P.o =1. Asp r.r xle XXe A., val 

Oly OXu .eu val Ser P„ T.. xxe Asp xXe Thr .eu As„ .,u Met 

200 205 

Thj PHe Gl. VaX Glu XXe AXa Asn His „U Ty. .i„ p,, 

220 

Val Lys CXy Gly Gin Gl. Tyr His Ser P„ GXy Ar, Tyr Leu Val Glu 

240 

Gly ASP AXa ser Ser AXa Ser TVr Phe Leu AXa AXa GXy GXy XXe Lys 

250 255 
Gly Gly Thr VaX Lys Val Thr GXy Ue GXy Gly Lys Ser „e. GXn Gly 

270 

ASP lie Arg Phe Ala Asp Val Leu His Lys Met Gly Ala Thr He Thr 

280 285 

Trp Gly Asp Asp Phe He Ala Cys Thr Arg Gly Glu Leu His. Ala He 

300 

ASP Met ASP Met Asn His He Pro Asp Ala Ala Met Thr He Ala Thr 



310 



320 



Thr Ala Leu Phe Ala Lys Gly Thr Thr Thr Leu Arg Asn He Tyr Asn 
^25 330 335 

Trp Arg Val Lys Glu Thr Asp Arg Leu Phe Ala Met Ala Thr Glu Leu 
^'^^ 345 350 



^rg Lys Val Gly Ala Glu Val Glu Glu Gl 



•55 



y His Asp Tyr He Arg I] 



360 355 



e 



Thr P^o Pro Ala lys Leu Gin His Ala Asp He Gly Thr Tyr Asn Asp 



La _ 

230 



His Arg Met Ala Met Cys Phe Ser Leu Val A. 

395 400 



- Qc - "-^^ "<2u Ser Asp Thr Pro 

-"^ 290 



-171- 



38-21(10660)A 



Val Thr lie Leu Asp Pro L.s Cys Thr Ala Lys Thr Phe Pro Asp Tyr 

415 

Phe Glu Gin Leu Ala Arg Met Ser Thr Pro Ala 
420 . 

!) INFORMATION FOR SEQ ID NO: 59: 



(i) SEQUENCE Characteristics : 

(A) LENGTH: 427 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 59: 

Met Glu Ser Leu Thr Leu Gin Pro lie Ala Arg Val Asp Gly Thr Val 
5 10 15 

Asn Leu Pro Gly Ser Lys Ser Val Ser Asn Arg Ala Leu Leu 



20 25 



Leu Ala 
30 



Ala Leu Ala Arg Gly Thr Thr Val Leu Thr Asn Leu Leu Asp Ser Asp 

40 45 

Asp val Arg His Met Leu Asn Ala Leu Ser Ala Leu Gly Val His Tyr 

55 £0 

Val Leu Ser. Ser Asp Arg Thr Arg Cys Glu Val Thr Gly Thr Gly Gly 
" 75 80 

Pro Leu Gin Ala Gly Ser Ala Leu Glu Leu Phe Leu Gly Asn Ala Gly 
35 ?0 95 

Thr Ala Met Arg Pro Leu Ala Ala Ala Leu Cys Leu Gly Ser Asn Asp 

105 no 

He Val Leu Thr Gly Glu Pro Arg Met lys Glu Ara Pro lie Gly His 

120 125 



-eu val Asp Ala Leu Arg Gin Gly Gly Ala Gin lie Asp Tyr Leu Glu 

-^5 140 



■30 



-172- 

38-21(10660)A 



=ln CZ„ .3„ ryr P.o .eu 

" 160 
ASP v,l CI. v,l ASP 31, ser Val ser Ser Gin Ph, .eu 

Leu Met Ala Ser Pro Leu 



,7n Thr Ala Leu 

" 175 



,180 Val lie Ala He Lys 

190 

Olu V.X se. p„ 

"^^^ 205 

Lys Thr Phe Gly Val Glu v^i , 

210 "^'^ III Gin Ala Tyr Gin Arg Phe He 

■^■^^ 220 

va. ... OX, P„ ol. .3p 

«XV ASP s„ s« 

■^^^ 255 
Gly Gly Thr Val Lys Val THt- ri„ n ^-v 

260 A^'^ Ser Val Gin Gly 

270 

ASP lie Phs M, .sp V.1 .eu clu .,s Met Gl. .1. ™. v.l Thr 

^BO 285 

Trp Gl, Clu ASP ryr He .1. c.s Thr ..rg CI, Glu Leu As„ Ala He 

300 

Asp Met Asp Met Asn His T^^ ::v-^ ^ 

305 --° -^^P -^i^ Met Thr He Ala Thr 

320 

Ala Ala Leu Phe Ala Arg Gly Thr r ^ , 

3 25 "** " Asn 

-2° 335 

Trp Ar, v.l L,,s Glu T.r Asp Ar, Leu Phe Ala Met Al. Thr Glu Leu 



-"^^ 350 



Arg Lys Val Gly Ala Glu Val r--. ,-i 

■'55 '^"'■^ ■^-'■'^ -le Arg lie 

-=*^ 365 

Thr Fro ?ro Leu T'^r ' .^n r ■> « -u , i , . 

-0 ^'"^ "t:"! Gly Thr Tyr Asn Aso 



380 



His Arg Met Ala M 



---- ..w. .-ixci :':ec C'/s Pbe ^t^r r ,!! r 

:.85 ;oo ? 

400 



■^'^3- 38-21(10660)A 



Val Thr lie Leu Asp Pro Lys Cys Thr Ala Lys Thr Phe Pro Asp Tyr 

410 

Phe Gly Gin Leu Ala Arg He Ser Thr Leu Ala 
420 425 

(2) INFORMATION FOR SEQ ID NO: 60: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 427 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 60: 

Met Leu Glu Ser Leu Thr Leu His Pro He Ala Leu He Asn Gly Thr 

1 5 . 10 15 

Val Asn Leu Pro Gly Ser Lys Ser Val Ser Asn Arg Ala Leu Leu Leu 
20 25 30 

Ala Ala Leu Ala Glu Gly Thr Thr Gin Leu Asn Asn Leu Leu Asp Ser 
35 40 45 

Asp Asp He Arg His Met Leu Asn Ala leu Gin Ala Leu Gly Val Lys 

55 50 

Tyr Arg Leu Ser Ala Asp Arg Thr Arg Cys Glu Val Asp Gly Leu Gly 
^'5 70 75 80 

Gly Lys Leu Val Ala Glu Gin Pro Leu Glu Leu Phe Leu Gly Asn Ala 
55 ?o 95 

Gly Thr Ala Met Arg Pro Leu Ala Ala Ala Leu Cvs Leu Gly Lys Asn 
-00 105 110 

Asp He Val Leu Thr Gly Glu Pro Ara Met Lys Glu Arg Pro He Gly 

i:0 125 

His Leu Vai Asp Ala Leu Arg Gir. Gly C-lv Ala Gin He Aso Tvr Leu 
HO 135 j_4o 



38-21(10660)A 



Olu Cln Olu Ar, Ar, c.s He Ala Sly Gly Ar, sly Oly 

160 

Lys Leu Thr v,l ASP Oly ser v,l ser Ser Oln Ph. Leu Thr AU Leu 



160 

Gin Phe Leu Thr Ala 

170 

Leu Met Thr Ala Pro Leu Ala rin r-i^ a 

ICQ lie Gin He Gin 

185 

Gly Glu Leu Val Ser Lvs Prn t^,^ ti « 

195 "^^^ lie Asp He Thr Leu His Leu Met 

^00 205 

Lys Al, Phe Oly val ASP vel Vel His olu Asn .:vr 01„ lie Phe His 

220 

lie Lys Oly Oly ol„ Thr Tyr A., Ser Pro Oly He Tyr Leu Val olu 

240 

Gly ASP Ala ser Ser Ala Ser Tyr Phe Leu Ala Ala Ala Ala lie Lys 

255 

Gly Gly Thr Val Arg Val Thr Gly He Gly Lys Gin Ser Val Gin Gly 

270 

ASP Thr Lys Phe Ala Asp Val Leu olu Lys Met oly Ala Lys Ue Ser 

280 285 

Trp Gly Asp Asp Tyr He Glu Cys Ser Arg Gly Glu Leu Gin Gly He 



295 220 



ASP Me. ASP „et Asn „.s He Pro Asp Ala Ala :.et T.-.r He Ala Thr 



315 



320 



Thr Ala Leu Phe Ala Asp Gly Pro Thr Val He Arg Asn He Tyr Asn 

330 

Trp Arg Val Lys Glu Thr Asp Arg Leu Ser Ala Met Ala Thr Glu Leu 



345 



350 



Arg Lys Val Gly Ala Glu Val Glu Glu Gly Gin Asp Tyr He Arg Val 



Val Fro ?rc Ala Gin Leu lie Ala Al^ -,u ^ 

370 '^■^^ -'^ ^'-y "^2: Tyr Asn Asp 



365 

La g: 



250 



His Arg Met Ala Me 



t Cys Phe 



2 85 :Qn ' ^"^^ *''^^ ^'^'^ '^^^ 



390 



395 



400 



• 17'?. 

38-21(10660)A 



Val T.. xxe .eu .sp P.o ..s C.s ... .,3 TK. PHe P.o .sp 



. 415 



Phe Glu Gin Leu Ala Arg Leu Ser Gin He Ala 

425 

(2) INFORMATION FOR SEQ ID NO: 61: 

(i) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 432 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 61: 

Met Glu Lys lie Thr Leu Ala Pro He Ser Ala Val Glu Gly Thr He 

10 

As„ L.„ P.O ojy ser .ys Ser Leu Ser As„ Ar, Ala Leu Leu Leu Al. 

25 30 

Al» Leu Ala Lys cly Thr Thr Lys Val Thr Asn Leu Leu Asp Ser Asp 

40 45 

ASP lie Arg His Met Leu Asn Ala Leu lys Ala Leu Gly Val Arg Tyr 

= 5 50 

Gin Leu Ser Asp Asp Lys Thr He Cys Glu He Glu Gly Leu Gly Gly 

' 75 on 



80 

Ala Phe Asn He Gin Asp Asn Leu Ser Leu Phe Leu Gly Asn Ala Gly 

?0 05 



Thr Ala Met Arg Pro Leu Thr ^'a si» - 

100 ' --■^ -^s" "is 



.05 



.10 



Glu Vai Glu He He Leu ^h*- r.-v r',, -v^ ^ 

..- ^--V '-^o Arg Met Lys Glu Arg Pro 

--3 HS 

He Leu His Leu Vai Aso Ala Leu Ar-a aI;, r'„ ^ 

•30 • '^'■^ ^-^y '^ia Asp ..e Arg 

-^^ 140 



38.21(10660)A 



Tvr teu CXo As„ Clu 01. ryr Pro Pro Leu Al, XXe A., As„ 01. 

160 

ne .y. Olv 01, 

170 j^75 

Leu Thr Ala Leu Leu Met Ser Ala Pr-r. ro,. ai o-. 

iL80 ^^"^ Asn Asp Thr Glu 

190 

lie Olu lie ne 01. olo .eu Vel Ser ..s Pro Ue Asp He Thr 

^00 205 

.e„ Al. Met Met Ar, Asp Phe 01. v.l L.s Val Olu Aen His His ryr 

^•^^ 220 
=ln L.s Phe 01„ val L.s oly As„ Oln Ser Tyr He Ser Pro Asn Lys 

240 

Tyr Leu v,l Olu oly Asp Ala Ser Ser Ala Ser Tyr Phe Leu Ala Ala 

250 255 
Gly Ala lie Lys Gly Lys Val Lys Val Thr Gly He Gly Lys Asn Ser 

265 270 
He Gin Gly Asp Arg Leu Phe Ala Asp Val Leu Glu Lys Met Gly Ala 



Lys lie Thr Trp Gly Glu A 



lu. :rp ^xy Glu Asp Fhe He Gin Ala Glu Kis Ala Glu Leu 



;oo 



Asn Gly lie Asp Met Asp Met Asn His He Pro Asp Ala Ala Met Thr 

315 320 

lie Ala Thr Thr Ala Leu Phe £er Asn 



225 330 



Gly Glu Thr Val He Arg Asn 

335 

He Tyr Asn Trp Arg Val Lys Glu Thr Asp Arg Leu Thr Ala Met Ala 
-'^0 345 350 

Thr Glu Leu Arg Lys Val Gly Ala Glu Val Glu Glu Gly Glu Asp Phe 
-55 3 60 365 

He Ar^ He Gin Fro Leu Ala Leu Asn Gin Phe Lys His Ala Asn He 
' " -~5 330 



Glu Thr Tyr Asn Asd Hi 

-85 390 395 



s Arg Met Ala Met Cys Phe Ser Leu He Ala 

400 



38.21(10660)A 



Leu Ser Asn Thr Pro Val Thr He Leu Asp Pro Lys Cys Thr Ala Lys 
405 410 

Thr Phe Pro Thr Phe Phe Asn Glu Phe Glu Lys He Cys Leu Lys Asn 

425 



430 



(2) INFORMATION F^R SEQ ID NO: 62: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 441 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 62: 

Val He Lys Asp Ala Thr Ala He Thr Leu Asn Pro He Ser Tyr He 
15 10 15 

Glu Gly Glu Val Arg Leu Pro Gly Ser Lys Ser Leu Ser Asn Arg Ala 
20 25 30 

Leu Leu Leu Ser Ala Leu Ala Lys Gly Lys Thr Thr Leu Thr Asn Leu 

35 40 45 

Leu Asp Ser Asp Asp Val Arg His Met Leu Asn Ala Leu Lvs Glu Leu 
-'^ 55 60 

Gly Val Thr Tyr Gin Leu Ser Glu Asp Lys Ser Val Cys Glu He Glu 
" '0 75 80 

Gly Leu Gly Arg Ala Phe Glu Trp Gin Ser Gly Leu Ala Leu Phe Leu 
35 90 95 

Gly Asn Ala Gly Thr Ala Met Arg Pro Leu Thr Ala Ala Leu Cys Leu 
iOO loS 110 

Ser r:-r ?ro Asn Arg Glu Gly Lys Asn Glu He Val Leu Thr Gly Glu 
-5 120 125 

Pro Arg Met Lys Glu Arg Pro He Gin His Leu Val Aso Ala Leu Cys 
-^0 135 140 



"^^^ 38-21(10660)A 

Oln AU Oly Ma Olu lie 01„ ryr Leu CI. cl„ cXu ol. p,, p„ 

160 

lie Ala lie Arg Asn Thr Gly Leu Lys Gly Gly Arg He Gin He Asp 

170 

Oly ser Val s« Ser sl„ Phe Leu Th. Ala Leu Leu Met Ala Ala P„ 

i 190 
Met Ala Glu Ala Asp Thr Glu He Glu He He Gly Glu Leu Val Ser 

200 205 

Lys Pro ryr He Asp He Thr Leu Lys Met Met =1„ Thr Phe Gly v.l 

220 

Glu val Glu Asn Gin Ala TVr Gin Arg Phe Leu Val Lys Gly His Gin 

240 



Gin Tyr Gin Ser Pro His Arg Phe Leu Val Glu Gly Asp Ala 



245 250 



Ser Ser 
255 



Ala ser Tyr Phe Leu Ala Ala Ala Ala He Lys Gly Lys Val Lys Val 

265 270 

Thr Gly Val Gly Lys Asn Ser He Gin Gly Asp Arg Leu Phe Ala Asp 
275 280 285 

val Leu Glu Lys Met Gly Ala His He Thr Trp Gly Asp Asp Phe He 

290 295 300 

Gin Val Glu Lys Gly Asn Leu Lys Gly He Asp Met Asd Met Asn His 

310 

He Pro Asp Ala Ala Met Thr He Ala Thr Thr Ala Leu Phe Ala Glu 
225 330 335 ■ 

Gly Glu Thr Val He Arg Asn He Tyr Asn Trp Arc Val Lys Glu Thr 

345 350 

Asp Arg Leu Thr Ala Met Ala Thr Glu Leu Ara Lvs Val Gly Ala Glu 

360 . 355 

val Glu Glu Gly Glu Asp Phe He Arg He Gin Pro Leu Asn Leu Ala 

375 380 

Gin Phe 0-ln His Ala Glu Leu Asn He His Asp His Ara Met Ala Met 

-95 *:an -Qc 

--^ -95 400 



38-21(10660)A 



cys Phe Ala L.U ne AU Leu Ser Lys Thr Ser Val Thr Ue Ceu Asp 

410 4j^5 

Pro ser Cys Thr Ala Lys Thr Phe Pro Thr Phe Leu He Leu Phe Thr 



430 



Leu Asn Thr Arg Glu Val Ala Tyr Arg 
435 

(2) INFORMATION rt)R SEQ ID NO: 63: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 426 amino acids 

(B) TYPE: amino acid 
iD) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 63: 

Asn Ser Leu Arg Leu Glu Pro He Ser Arg Val Ala Gly Glu Val Asn 



10 



15 



Leu Pro Gly Ser Lys Ser Val Ser Asn Arg Ala Leu Leu Leu Ala Ala 
2° 25 30 

Leu Ala Arg Gly Thr Thr Arg Leu Thr Asn Leu Leu Asp Ser Asp Asp 

40 ,5 

lie Arc Hxs Met Leu Ala Ala Leu Thr Gin Leu Gly Val Lys Tyr Lys 

55 60 

Leu ser Ala Asp Lys Thr Glu Cys Thr Val His Gly Leu Gly Arg Ser 



75 



80 



?he Ala Val Ser Ala Pro Val Asn Leu Phe Leu Gly Asn Ala Gly Thr 



35 



?Q 



95 



Ala Met Arg Pro Leu Cys Ala Ala Leu Cys Leu Glv Ser Glv Glu -yr 

105 ii'o 



Met Leu Gly Gly Glu Pro Arg Met Gl 



u Glu Arg Pro He Gly His Leu 



125 



Val Asp Cys Leu Ala Leu Lys Gly Ala His He Gin 



— ^ 135 



Tyr Leu Lys Lvs 
140 



■180- 



38-2l(l0660)A 



- - - - 

Asp Val His Val Asd ru, e 

Asp Gly ser Val Ser Ser r^r. nv, 

no Phe 



165 



175 



Leu Met Ala Ala Pro Ala Mo^ 

ifin Pro Val He p^^ « 

185 ^""^ lie His He 

190 

Lys Gly Glu Leu Val S*.r r 

ser 

205 V 



Met Asn Ser Ser Gly Val Val 

210 Hi 



215 



s Asp Asn Tyr Lys Leu Phe 

- - ^V. 3. ^^^^ 

- - - iz - - 3. 



255 



- - - V. ... ^^^^ 

- V. 0. 

285 



Gly Asp Asp Phe He Glu Ala Glu 



:9o 



295 



Gin Gly Pro Leu His Gly Val Asp 
300 ^ 



Met ASD Met A^n r' 

305 ■ " Asp val Gly His Aso Hx 

ser H.s Cys Leu Pro Ar. Val Pro Pro H.s Ser Gin Hi 



Ala val Arg Asp Asp Arg Cys Thr Pro Cys Th 



LS Ser Gly Gin 
320 

s Leu Gin Leu 
335 



i40 



345 



Thr His Gly .His Arg 



Ala Gin Ala Gly vkl Ser Glu Glu Gly Thr Thr Ph 



350 



Arg 



Ala Ala Asp Pr 



160 



he He Thr Arg Asp 
255 



° ''^^ -^'^ Arg His Leu Gin Arg Ser 

-30 

Arg He Aia Met Cvs Phe -'er ' p,, v;.i - 

335 ■ -'^ -eu Ser Asp He Ala Val 

400 




38-2l(l0660)A 

Thr He Asn Asp Pro Gly cvs rh^ c 

"5 " To ™' 'Vr P., 

se. V,, se. oi„ 

425 

(2) INFORMATION FOR SEQ ID NO: 64: 

(i) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 442 am-; „ • 

(B) TYPE, in, "° ^'^^^^ 
\oi IX eE: ammo acid 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: 



protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO,64. 

S. c.. .eu ^^^^ 

01. Olu v. P., CI. s„ . „e s„ ^ 

30 

Leu Leu Ala Ala Leu Ala Glu Gly Ser -h. r^ r^ 

3 5 "l;^ '^"^ He Thr Gly Leu Leu 

45 

Asp Ser Asp Asp Thr Ara Val Mec 'eu V. - 

;0 - '-^^ -^-^ Ala Leu Arg Gin Leu Gly 

60 

V.1 se. V.I civ Clu Val .1, ,1, „^ 

Ala .r, Phe P„ ... '° 

' 95 

Thr Ala Phe Arg Pro Leu T^-r- m 

100 Gly Asp 

-^^^ 110 

Tyr Arg Leu Ser Cly Val Pro Arg Met H.s Glu Arg Pro lie Gly Aso 

"■^ " 125 



Leu Val Asp Ala Leu 

135 



130 '^t': '^"^ -"^^^ ---^ Glu Tyr Leu Gly 

140 



I 



-182- 

38.21(10660)A 

- - .0 

- - - 

175 

- - - - V. ^ 

- - - - „^ 

- - - 

Gly Trp Arg Ala Phe Thr Tlo , 

™S "-^^ "^-^ ''-1 Tyr A„ oly ,ro 

^ 240 
" - 0.. 3„ ... 3„ ... 

=1. J. ne 0.. c., p 

270 



^sp s.. ... 

i 285 
Met Gly Ala Asp Val Arrr t„»- r-i r, 

290 Trp lie gi^ Thr Arg Gly 



300 



-1 v., ci„ .J, ,^.3 

320 



Asn .e. lie 

Ala Asp Gly Pro 



335 



340 Trp Arg Val Lys 

Olu T.r ASP Arg :ie H.s Ala Mec His r.r Glu Leu Glu Lys Leu Gly 

365 

■^^^ '^-y '■''31 Gin -^er C'x, ^ ™ 

-0 -^'^ ^-^^ -^'-^ ^^'-^ ^'-^l Ala Pro Pro Glu 

cl, aiv ^.p ..p . 

--^ 400 




-183- 

38-2l(l0660)A 

- - - - - ... 

Val TV. .1 430 
^al Tyr Ala Gly Leu Leu Ala Ai. . 

435 '^■^^ Ala Arg Asp 

440 

(2) INFORMATION FOR SEQlDNO:65: 

(i) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 427 amino a^ids 

(B) TYPE: amino acid 
<D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: 



protein 



(xi) SEQUENCE DESCRIPTION 



SEQ ID NO: 65: 

se. ... „^ ^^^^ ^^^^ 

.e. P.O o.v se. ..3 Se. v.. 3„ 



■5c Ala 

30 

U. OX. .V. VjX ... ^ 3„ ^ 

-p v.i ^.^^ 

Thr s.. Ma .sp ™. ^^^^^ 

' ^ 80 

Pro Leu Arg Ala £er Gly Thr Leu Glu Leu Phe Leu Gl, 



5 5 7^'-^ ^•'^•e -eu Gly Asn Ala Gly 



?0 35 



Thr Ala Mec Ara P'-o "L^u ai^ ii= m 

y ^-o Leu Ala Ala Ala Leu Cys leu Gl 



. — nxa ^eu cys .eu Gly Gin Asn Glu 

-■-•5 110 



:ie Val leu Thr Gly .31u ?ro Arc Met Lvs Glu Ar 



120 



? He Gly H 

.25 



IS 



-184- 



38-2l(10660)A 



- -J s.. c.„ o:. ox. ... ^ 

140 

Gin Glu Asn Tyr Pro Pr-o r ^ » 

Pre P„ Leu Ar, .e. Arg GXpiy Phe He 01, 

Asp He Glu Val Asp Gly Ser Val o 

^^P y ser val Ser Ser Gin Phe Leu Thr Ala Leu 

175 

Leu Met Thr Ala Pro Leu Ala Pro Glu Asp Thr II. 

180 lie He Arg Val Lys 

" 190 

Gly Glu Leu Val Ser Lys Pro Tvr He 

100 



He Asp He Thr Leu Asn Leu Met 



205 



Lys Thr Phe Gly Val Gin ti<» »t , 

210 Phe val 

'^■^^ 220 

-X .vs ozv 01. al„ oi„ ryr HI. se. P„ 01. 

"^^^ 240 



230 



Oly ASP ser Ser Ser Tyr Phe Leu Al. ol. 01. il 

250 25 
01. 01. Thr v,l ... V,, 



e Lys 
255 



265 

Asp He Arg Phe Ala Asp Val Leu His Lv 

-'■^ 230 



270 

s Met Gly Ala Thr He Thr 
285 



Trp Gly Asp Asp Phe He Ala Cys Jhr Arg Gly Glu Leu Hxs Ala He 



295 



)00 



Asp Met Asp Met Asn His He Pro Asp Al 



305 



310 



P Ala Ala Met Thr He Ala Thr 



315 



320 



rhr Ala Leu Phe Ala Lys Gly Thr Thr Thr Leu Arg Asn He Ty 
-^^ 330 



•rp Arg Val Lv 



340 

Arg Lys Val Gly Al 
: 5 5 



ys Thr Asp Arg Leu Phe Al 



345 



a Glu Val Glu Glu Gly His Asp Tvr 
3 50 



r Asn 
335 

a Met Ala Thr Glu Leu 
350 

He Arg He 



Thr Pro Pro Ala Lv 
3 70 



ys Leu Gin His Ala Asd 11 



e Gly Thr Tyr Asn Asp 

-'^ 390 




■185- 

38.21(10660)A 



His Arg Met Ala Met Cv« r>u ^ 

385 Phe Ser Leu Val Al;, . 

Ser Asp Thr Pro 

Thr lie Leu Asp P.o Lys Cys Thr AI r 

Cys Thr Ala Lys Thr Phe Pro Asp Tyr 

Phe Glu Gin Leu Ala Ar« ^ 

Ala Arg Met Ser Thr Pro Ala 

425 

(2) INFORMATION FOR SEQ idnO:66: 

(i) SEQUENCE CHARACTERISTICS- 

A LENGTH: 1894 hase'p'airs 
: TYPE: nucleic acid 
C STRANDEDNESS: double 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE- HMa / 

^ift. DNA (genomic) 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 275.. 1618 

(XX) SEQUENCE DESCRIPTION: SEQ IDMO:66: 

ACGGGCTGTA ACGGTAGTAG GGGTCCCGAr r^r^... 

GGTCCCGAG CACAAAAGCG GTGCCGGCAA GC.^^GAACTAA 

..TCCATGGG CAATAATCCT ..TTTCATTGG .TTGGCCTCT OGTCTGGCAA .GGTTGCTAG 
^CAAATTA ACAAACTGTC GCCCTTCCAC TGACCATGGT .:UCGATGTTT 

TTTACTTCCT TGACTAACCG .:^GGAAAATT'" rrrnnr^r^r- 

OGAAAATT. GGCGGGGGGC AGAAATGCCA ATACAATTT'A ' 

GCTTGGTCTT CCCTGCCCCT ' V^TT-^G-rrr ----^ 

•WT.GxCCC CTCC ATG GCC TTG CTT TCC CTC 

Met Ala Leu Leu Ser Leu 

~ 5 

AAC AAT CAT C.\A TCC Cxr r^i r^r^r^ ^ 

As„ s^^ H?! J «T CCC CC. OCC 

10 f. '-^^ Asn Pro Pro Ala Gin 

20 

GGG GTC GCT TTG ACT GGC — r-> • ^ 

Gly Val Ala Leu r-.r C'v Ir^ f ""^^ ^^T AAA TCC .ATT 

L.y Arg Leu Arg Val ?ro Gly Asp Lys Ser zVe 



60 
120 
180 
240 
292 

340 

398 




-186 
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TCC CAT CGG GCC TTG ATG TTG rrr n.r. 

- H. ^ - =00 „c .CC .CC 

50 

ATC GAA GGG CTA CTG TTr n^o 

lie OXU Cl, .e„ S= Z Z Z L'o T 
" 60 ^ Thr Ala His Cys 

- .^.^ :s s,^ s o=?i - - - - - CM Z 

lie ser Glu Leu Asn Ser Glu Lys He 
ATC GTT CAG GGT CGG GGT CTG rr^ n.r, 

He Val Gin Gly Arg Gly Zl G^y G^n Zl C^^ ^^C GTT 

90 Glu Pro Ser Thr Val 

100 

TTG GAT GCG GGG AAC TCT rrr ^r^r^ , 

Leu ASP Ala Gly Asn Se" G^y Thr Me't 

105 ^ f!^^ Met Arg Leu Met Leu Gly Leu 

CTA GCC GGG CAA AAA GAT TGT TTA TTC Arr r^n 

- =1. Cl„ ... ™ -c .CC C.C 0„ 0„ .cc 

130 

lIu Arg' S^J A^g^ M^'^ A^^ ft tf ^AA ATG 

135 III Val He Gin Pro Leu Gin Gin Met 

'ny A?a ^ tll f C GGC ...G TTT GCG CCG CTG GC^ 

Trp Ala Arg Ser Asn Gly Ivs ?he Ala Pro Leu Ala 

-^^ 165 

GTC CAG GGT AGC CAA tta a ax 

0. s„ Ut - - I- - - - - 

-''^ 180 
TCA GCC CAG GTA AAG TCC TGC C-G T-r r-^ 

ser Ala Gin Val Lys Ser Cvs ^^n 1 ? "^"^ "^^^ GAG 
185 ^ -^^^ Gly Leu Thr Thr Glu 

195 

GGG GAC ACC ACG GTT ACA Pia nr~^ ^^,r, 

Gly ASD Thr Thr Val f ^ ^AT AGC GAA 

200 -'^^ Arg Asp His Ser Glu 



205 



S M^t leu oln Sa P^e GW Ala 

:i5 ^^on "^^'^ ^^'-^ Asp Pro Val Thr 

230 



436 



484 



532 



580 



628 



676 



724 



772 



820 



S68 



?16 



?64 




187- 
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CAT AGC GTC ACT mr r-xm 

240 " Val 

GTG GTG CCA GGG GAC ATr 

- - - 4 S S S S S .™ SI? - - 

255 " Ala Ala 

- ne : J - - S ™ ™ - c„ 1° ^ 

270 lie Aan 

CCC ACC AGG ACA GGG GTG TTG CAi 

- - ™^ 0.. V,, ™ ™ -0 c« 

285 '^^^ Met Gly Ala Asd 

290 ^ 

ATT ACC CCG GAG AAT ria r^o. 

Ill - -I - - - ™ JC= , 

3 00 Glu Pro Val Ala Asp 

CTG CGG GTT AGG GCA AGC CAT CTC r.n 

^eu Ar. val Arg Ala Ser Hil L,' G^n Gly' c'' <^AA 

Gin Gly Cys Thr Phe Gly Gly Glu 

325 

ATT ATT CCC CGA CTG ATT pa^ 

lie Pro Ar. .eu' ^ -C 

330 3^3'^ lie Leu Ala Val Ala Ala 

GCC TTT GCA GAG GGC ACT ACC CCr ^^^^ 
Ala Phe Ala Clu Gly ... GCC GCA GAA CTG AGG 

Glu ^sp Ala Ala Glu Leu Arg 

GTT AAA GAA AGC GAT CGC CTG GCr r^n ' 

val Lys Glu Ser Asp Arg Su Ala A^a lH t? ^ ^ 
3 60 Ala He Ala Ser Glu Leu Gly Lys 

370 

ATG GGG GCC gTC ACC GAA TTt r- 

Met Gly Ala Lys Val Thr HI GAA ATT CAA GGG 

3 ^'"^^ Asp Gly Leu Glu He Gin Gly 

390 

'^GA AGC CCG 7TA CAA GGG GCC GAG r-r 

Gly ser Pro Leu Gin Gly Ala ctu vli f " ^^T CGC 

395 ^^1 Asp Leu Thr Asp His Arg 

405 

ATT GCC ATG GCG TTG GCG ^tc --r 

:ie Ala Met Ala Leu Ala I'e A^a a^^ 7^ ^"^^ "^^"^ ACA 
410 '^IV Ser Gly Gly Gin Thr 

"* ^ 420 



1012 
1060 
1108 
1155 
1204 
1252 
1300 
1348 
1396 
1444 
1492 
1540 



1894 
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430 ^ ° Phe Phe 

GGC ACG CTA GGG CAA PTT 

=1V Th. .eo 01. 01„ val S ""CTCC« 

ATGGACCGTA TCGTTCAAAC TGGGTAATTC 1 , c c 
TCCCTTTAAT TCCTTAAAAG CTCGCTTAAA ACTGCCCA.. ^ 

lAAA ACTGCCCAAC GTATCTCCGT AATCrrra^m 

GAGTAGAAGT ..ATGGGGCCA AACGGCGATC GCCACC.^ ^^^CGAGT 1815 

C^CGATC GCCACGGGAA ATTAAAGCCT GCATCACTGA .on 

CCACTTATAA CTTTCGGGA ^ATCACTGA 1875 

(2) INFORMATION FOR SEQ ID NO:67: 

(i) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 447 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

'Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 67: 

Met .la .e. Leu Ser .eu Asn Asn H.s Cln Ser H.s Gin Arc Leu Thr 

10 - ,5 

Val Asn Pro Pro A l rir> ,r 

ro Ala Gin Gly Val Ala Leu Thr Gly Arg Leu Arg Val , 

Pro Gly Asp Lvs Ser -"e q*^^ u; . , 

_ - r _e Ser H.s Arg Ala Leu Met Leu Glv Ala Jie 

40 ^5 

Ala Thr Gly Olu Thr lie ^^e Gla r'v, r - 

50 • Leu Gly Glu Asp Pro 

-50 

Arg ser Thr Ala Hxs Cys Phe Arg Ala Met Gly Ala Glu He Ser Glu 

" 30 

Leu Asn ser .lu Lys Zle lie Val Cln Gly Arg Gly Leu Gly Gin Leu 



30 55 
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^^^^ 

- - ^^^^ - 

- - .s. ^ 

140 

- o:„ 

Lys Phe Ala Pro Leu Ala Val ri 

val Gin Gly Ser Gin Leu Lys Pro 11^ His 

Tyr His Ser Pro He Ala Ser Ala Gin u.i r 

180 ""^^ ^J" Lys Ser Cys Leu Leu Leu 

190 

oxv .eu ... .K. c.u ex. ™. 

„u se. OZ. M,, ^ 
''■^^ 220 



Thr ne P„ X.. „,3 se. v,x ... P„ uis 

235 

TH. OX. cxn vax v,x VaX P„ cx. .sp XXe Se. Se. 

250 

.eu V.X .x, .e. . 

, 270 

Glu Asn Val Glv He Acn dv^ t<w 

ly lie Asn Pro Thr Arg Thr Gly Val Leu Glu Val Leu 

280 285 
Ala Gin Met Gly Ala Asd Jie T^- o^^ ^^ ^ 

300 

Cly Glu Pro val Ala Asp Leu Arg Val Arg Ala Ser His Leu Gin Gly 

:20 

cys Thr Phe Gly Oly Clu He He P.o Arg Le, He Asp Glu He =^o 

330 

::e Leu Ala Val Ala Ala Ala Phe Ala Glu Gly Thr Thr Arg He Glu 



345 



-•50 




-190- 

38.21(10660)A 

Asp Ala Ala Glu Leu Aro i r 

360 lU 

365 



- Se. .eu c:. .,3 Me. c:. ... ,,3 Va. ... P.e .sp ..p 

-V Leu Clu X.e Cln c:. c:. Se. P.o .eu CIn Cl^ ... 
Ser Leu Thr Asp His Arg iie Ala 



3 90 ^•'y AJ.a Glu Val Asp 

400 



405 ''^'^ Ala He Ala Ala Leu 

«10 415 

Gly Ser Gly Gly Gin Thr He He Asn a 

420 Ala Glu Ala Ala Ala He 

430 

Ser Tyr Pro Glu Phe Phe Glv Th^ r 

435 ^ J^f Gin Val Ala Gin Gly 

(2) INFORMATION FOR SEQ ID NO: 68: 

(i) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 1479 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

iA) NAME /KEY: CDS 

(B) LOCATION: 107.. 1438 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 53: 
TTTAAAAACA ATGAGTTAAA AAATTATTTT TCTGGCAC..C GCGCTTTTTT TGCATTTTTT 60 
CTCCCATTTT TCCGGCACAA T..ACGTTGGT TTTATAAAAG GAAATG ATG ATG ACG 



Met Met Thr 



AAT ATA TGG C.nC ACC GCG CCC err -r— nr.r> ^-.^ 

Asn He Tro. His Thr ^'a -o ^ f '"'^ ^'^^ 

_ . -5 ...r .n.a .-ro .al ^^er Ala L^u Ser Gly Glu He Thr 



115 



16: 



-3 15 
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ATA TGC GGC GAT AAA TCA ATG Trr r., ^ 

- ... 3„ - s - - - ™ s s 



3S 



TTA OCA GAA GGA CAA Acr ri,» 
-u AU GXu Gl. ?S Z Z lll Tly 1^ 

Arg Gly p^e Leu Ala Cys Ala Asp 

50 

SI Z - C^J^ S - - ™ - - - A„ CAA 

55 "^^J Gly val Asp He Gin 



65 



AGA GAA AAA GAA ATA r^n , 

Ar. Glu Lys G^^ ^ ^ J ^^^^ ^TC GGA TTT CTG GGT TTG 

7 0 '^'^^ Gly Phe Leu Gly Leu 

^ 80 
CAG CCG CCG AAA GCA CCG TTA AAT ATr m ..^ 
Gin Pro Pro Lys Ala Pro Le^ Met G^^ f ^ f ^^"^ 

85 Tt ''^^ ^^'^ Gin Asn Ser Gly Thr Ser Met 

»u 95 

CGT TTA TTG GCA GGA ATT TTC rra rn^ ^ 

Arg Leu Leu Ala Gly ill leu IT. ''^^ ^^C GTG TTA 

100 ^ ''^^ Ala Gin Arg Phe Glu Ser Val Leu 

110 

TGC GGC GAT GAA TCA TTA raa 

CVS 0.. A3P z tit SI? =^j= - ji: ?n - 

CCG CTT GTG C.^:^ ATG GGG GCA AAA ATT GTC " r- r ^ --^ 

Pro Leu Vai C-ln Met G'v Ala Lv-. r 1 , 7 "-"^ -o^ .^-^T TTT ACG 

Aia i.ys Ixe Val £er His Ser Asn Phe Thr 
^ 140 

"a Pro IT. I ^11 IT f ^ f ° ACC GGC ATT GAT TAC GCG 

lie ber Gly Aro Pro Ton tu^ , _ 

150 '* Ala 

160 

TTA CCG CTT CCC AGC GCG CAA TTA AAA AGT TGC C^T ATT -G GCA GGA 

Leu Pro Leu --o S^r o^r, r r ^ .^ii ..u oCA GGA 

--0 ser ...a o.n Leu Lys Ser Cys Leu lie Leu Ala Gly 



170 -,5 



211 



259 



307 



355 



403 



451 



499 



547 



595 



643 



™ ^ :S ;S S SJI if =f coc „x 

130 -^c - ^^"^ "^^ ---'S Gly :ie Ser Arg 

':AC CAC ACG G.^ CGC ATG TTG CCG C'- ""T"^ r-- — 

ASP H.s T... . C.T =A0 .TC 
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AAG AAA GAG CAA ATA ATC GTr i^nn 

LVS Lys Glu Gin He He VaJ J'r cTy ct^ 

215 ^''^ "-^^ Gly cys 

225 

GTG CTT GAT ATT rrr r-r-o 

-1 z z z III s s -X 

230 P 5" Al« Al, AU Phe Phe Met Vsl 

240 

GCG GOT TTG ATT GCG CCG CGC GCC taa 

- - .e. „e p„ - - - CC. 

Z t?. Z IT, IT. S f^t ?s ™ ™ - - 

- - - - - - ™ 

280 ^ Ala Glu Pro Val 

290 

GCA GAT ATT GTT GTT TAT CAT TP a 2.^^ m,^^ 

M» .sp n. V. V. lit ^ ™ - - - 0^ c=c 

:z s= ?n r= r t 

310 ^ lie Phe Phe He 

320 

^la A^- ""^^ C-TG GGC ..t -^^q ^^A GAA 

«ia Axa rtla Cys Ala Glu Glv Thr '^hr- dk^ ■ 

3 25 ^-^^ -^sn leu Ser Glu 

■ . 335 

TTG CGT GTG AAA GAA TCG GAT CGT T-^A rrr rrr ^-^r^ 

Leu Ara Val Lvs Glu ^'er ^so Ar^ 1 ? ^ "^'^^ ""-^ "^^^ 

340 ' ^""^ '^^et Ala Gin Asn Leu 

s z -I - s« S 

0 ICC 

TAT GGA AGA AGC GAT CGG CAA T'^'" rrr- r--^ 

-r, ■ . - ^CG GCG CGG G G -AC Ar:T t^t^t 

-yr t..y Arg £er Asp Ara Gin Fhe ^ eu P-o Ar-^ ^ " 

" ° •"'•^^ '^^9 -c-L --^sn Ser ^he 

^30 .35 

ci5 Z ^ ?n r:= ^11 

:oo ' "^^ •■'^'^ •'••^^ C'-/ -'^1 Arg Ala 

-400 



787 



835 



931 



979 



1027 



1075 



1123 



1171 



1219 



1267 



1315 
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GCA GGT GAA TTA TTG ATT GAT GAr rrn ^ 

- 0. 0. - - 0. .eo oco c„ 

) 415 
CCG CAA TTT CGC GAT TTT Grr rr^n 

- - -p - - Z ^V. Z Z\ Z IT. 

435 

AAA GAT GCG AAA AAT TGT ri.n ^. 

Lys ASP Ala Lys Asn Cys hTs 2p AGCGGTGTTG GAAAAGGCAC 1455 

440 



GGTGGCGCAA GCTT 



(2) INFORMATION FOR SEQ ID NO: 69: 

(i) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 443 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:69: 

Met Met Thr Asn He Trn Hi<; Thr- »i „ 

^ irp His Thr Ala Pro Val Ser Ala Leu Ser Gly 

10 .5 



Glu He Thr He Cvs Glv 

-s Arg Ala Leu Leu 



V Asp Lys Ser Met Ser Hi 

" 30 

Leu Ala Ala Leu Ala Glu Glv Gin Th,- ^i T^ 

3 5 ^ ^^"^ Arg Gly Phe Leu Ala 

40 45 

Cys Ala Asp Cvs Leu Ala a,-^ ■ 

50 ' ^""^ "^^^ L-^" Arg Ala Leu Gly Val 

60 

Asp lie Gin Arg Glu Lys Glu He Val -hr - ^ a ^, 
55 "^^^ 

30 

Leu Gly Leu Glr. Pro Pro 



■s Aia Pro Leu Asn Met Gin Asn Ser G] 



90 ,5 

:r Ser Met Ara Leu ■ 

-v-y ^eu r^xd Gxv iie Leu aia a i c» i > 

' ■•^0 ' "^^^ Glu 

110 



1479 
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- LI - - - - - 

- - - 3. J 

ASP Tyr Al, :,eu Pro Leu Pro Ser AU Gin t. 

"5 "J Cx, Uu n, 

175 



200 Ala 

2 05 



Leu Ala Gly Leu Leu Ala Asp Gly Thr Thr a 

180 ^ Thr Arg Leu His Thr C^s Gly 

lie Ser Ar. Asp His Thr Glu Ar. Mec Leu Pro Leu 

200 

- -J ne ... „^ ^^^^ 

Hi. 01. V. A.P 

.1 A. AU ne AU Pro Ar, AU =1. v. v.l He A^ 

"^^^ 255 
Asn Val Gly He Asn Pro Thr Aro A^. ^^ 

260 ''''^ He Thr Leu Leu Gin 

270 

-ys Met Glv Glv Ara n,, r 

- rg ..e Glu Leu Hxs H.s Gin Arg Phe Trp Gly Ala 



285 



Glu Pro Val Ala Asp lie Val Val Tv,. u- - 

290 Ser Lys Leu Arg Gly He 

'^^^ 300 



Thr Val Ala Pro Glu Tro He Al^ i = r, 

305 3 ^® ..e Asp Glu Leu Pro II 

?he Phe He Ala Ala Ala Cys Al 



15 3^0 



325 « ^ •■^'^ V-1 Gly Asn 

335 

Leu Ser Glu Leu Ara Val •■•<= r'., - 

- .,s G.a ^er Asp Arg Leu Ala Ala Met Ala 

-■■'5 3 50 

Oin Asn Leu Gin "^hr- r -t. , 

-hr ..y val Ala Cys Asp Val Gly Ala Asp Phe 

355 



f 
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"0 V - - V. 

380 



Asn Ser Phe Gly Asp His Arg n 

390 



e/Ala Met Ser Leu Ala 
395 



Val Ala Gly 

V- 2° 
- s„ --^^^^.^.^ ne r 



420 



val Gly Glu Lys Asp Al/ Lys Asn Cys His Asp 

/ 440 



430 



